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Title: Expression of a l pha-macroglobullns 

FIELD OF THE INVENTION 

The present invention relates to the expression of a-macroglobu- 
5 Tins, derivatives and variants thereof, and especially the expression of the 
human a 2 -macroglobulin (ar 2 M) in an active form in mammalian cells, and the 
expression of genetically engineered variants thereof. The use of such 
recombinant a-macroglobulins, especially recombinant a 2 M(ra; 2 M) and variants 
is described with examples from the fields of medicine for therapeutic 
10 purposes, and the development of novel defined growth media for propagation 
of mammalian cells in culture. 

BACKGROUND OF THE INVENTION. 

BIOCHEMISTRY OF a.-MACROGLOBULIN {tt,Mh 

15 The proteinase binding glycoprotein a z M, which is synthesized in 

the liver, constitute together with the complement proteins C3, C4 and C5 a 
separate class of structurally and functionally related large plasma 
proteins. For a recent review see (Sottrup-Jensen, L. (1987) in: The Plasma 
Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, 

20 FL). 

Apart from C5 these proteins contain an internal B-cysteinyl- 
7-glutamyl thiol ester, which enables the proteolytically activated forms 
of a 2 M, C3, and C4 to participate in characteristic covalent binding reactions 
(Sottrup-Jensen, L., et al., (1980) FEBS Lett. 121: 275-280; Salvesen, G.S. 

25and Barrett, A.J., (1981) Biochem. J. 187: 695-701). The thiol ester 
structure, which in the active proteins can be slowly cleaved by a number of 
small nitrogen nucleophiles, constitutes a unique type of postsynthetic 
modification of proteins, and plays a prominent role in the biological 
properties of a 2 M. The presence of the active thiol esters in or 2 M is revealed 

30 by a characteristic pattern of heat fragmentation (Harpel , P.C., et al., 
(1979) J. Biol. Chem. 254: 8859-8878). 

Traditionally, a 2 M has been studied within the context of plasma 
proteinase inhibitors, although by several criteria it is unique. Whereas 
most plasma proteinase inhibitors are monomeric proteins of roughly similar 

35 size, containing approximately 430-500 residues, a^i is a tetramer whose 180- 
kD subunits contain 1451 residues (Sottrup-Jensen et al . , (1984) J. Biol. 
Chem. 259: 8318-8327). 

Furthermore, in contrast to most other proteinase inhibitors, 
which form 1:1 complexes with serine proteinases engaging the active site 
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of the proteinase and the reactive site of the inhibitor, a 2 M forms complexes 
with a broad spectrum of proteinases differing in their substrate specifi- 
city and catalytic mechanism e.g.: trypsin, leucocyte elastase, chymotrypsin, 
pancreatic elastase, cathepsin G, plasmin, plasma kallikrein and thrombin. 
5 The second-order rate constant for association between these 

proteinases and ajt varies by several orders of magnitude. Both 1:1 and 2:1 
proteinase-a^ complexes can be formed, and the disulf ide-bridged dimer (360 
kD) appears to be the functional unit of a 2 M (Sottrup- Jensen, L. (1987) in: 
The Plasma Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, 
10 Orlando, FL) . Contrary to "classical" proteinase inhibitor complexes the or^ 
bound proteinase is still active, especially toward small synthetic 
substrates (Sottrup- Jensen, L. (1987) in: "The Plasma Proteins" (Putnam, 
F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, FL). 

The mechanism of proteinase binding by a 2 M has been described by 
15 the "trap" (Barrett, A.J. and Starkey, P.M. (1973) Biochem. J. 133: 709- 
724), where proteolytic cleavage of a particularly exposed peptide stretch 
near the middle of the 180-kD subunit (the "bait" region) results in a 
conformational change of the a 2 M tetramer, thereby entrapping the proteina- 
se. The nature of the essentially irreversible proteinase complex formation 
20 with a 2 M has long remained elusive. However, recent investigations show that 
a major fraction (typically > 80-90 % of the trapped proteinase is also cova- 
lently bound through epsilon-lysyl (proteinase) -7-glutamyl (or 2 M) bonds (Sottrup- 
Jensen, L. et al., (1981) FEBS Lett. 128: 127-132; Sand, O. et al . , (1985) 
J. Biol. Chem. 260: 15723-15735; Pochon, F. et al . , (1987) FEBS Lett. 217: 
25 101-105). 

PHYSIOLOGICAL ASPECTS OF PROTEINASE-^ INTERACTIONS. 

Since the a^-proteinase complexes are rapidly cleared from the 
circulation (Ohlsson, K. (1971) Acta Physiol. Scand. 81: 269-272; Imber, 
30M.J. and Pizzo, S.V. (1981) J. Biol. Chem. 256: 8134-8139.) a general role 
as a "clearing vehicle" for plasma proteinases has been envisaged. 

The main physiological targets may include proteinases of the 
coagulation and fibrinolysis systems and plasma kallikrein, and perhaps also 
proteinases like leucocyte elastase, cathepsin G and collagenases and other 
35 proteinases released during cellular turnover (Sottrup-Jensen, L. and 
Birkedal -Hansen, H. (1989) J. Biol. Chem. 264: 393-401). 

Although ajfi may be largely confined to the vasculature in healthy 
uninflamed tissues, the inhibitor and its proteinase complexes are found at 
near plasma levels in inflammatory exudates of rheumatoid joints and gingival 
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crevicular fluids (Tollefsen, T. and Saltved, E. (1980) J. Periodont. Res. 
15: 96-106; Borth, W., et al . , (1983) Ann. N. Y. Acad. Sci. 421: 377-381). 

While plasma cr 2 M appear to be synthesized in the liver (Schreiber, 
G. (1987) in: "The Plasma Proteins" (Putnam, F.W., ed) 2nd Ed., 5: 294-363, 
5 Academic Press, Orlando, FL.) other sites of synthesis exist. Several cell 
strains in culture have been shown to produce a 2 M including fibroblasts 
(Mosher, D.F., et al . , (1977) J. Clin. Invest. 60: 1036-1045) and monocytes- 
/macrophages (Hovi, T., et al . , (1977) J. Exp. Med. 145: 1580-1589). 

Whereas hepatocytes and Kupffer cells of the liver are most 

10 important for clearance of a 2 M-proteinase complexes in plasma (Davidsen, O., 
et al., (1985) Biochim. Biophys. Acta 846: 85-92), fibroblasts (Van Leuven, 
F., et al., (1979) J. Biol. Chem. 254: 5155-5160; Mosher, D.F. and Vaheri, 
A. (1980) Biochim. Biophys. Acta 62Z: 113-122) and macrophages (Debanne, 
M.T., et al., (1975) Biochim. Biophys. Acta 411: 295-304; Kaplan, J. and 

15Nielsen, M.L. (1979) J. Biol. Chem. 254: 7323-7328) also possess receptors 
for a 2 M-proteinase complexes. 

These observations suggest that there may be a considerable 
extravascular turnover of a 2 M perhaps primarily carrying proteinases 
functioning in the cellular micro environment (Sottrup-Jensen, L. and 

20 Birkedal -Hansen, H. (1989) J. Biol. Chem. 264: 393-401). 

SUMMARY OF THE INVENTION 

Briefly stated, the present invention discloses a method for the 
production of recombinant a-macroglobul ins, and especially human or 2 M, and 
25 variants thereof in an active form. 

Within a preferred embodiment, the cultured host cell is an 
eukaryotic cell such as a mammalian cell or cells derived from organisms 
such as insects, plants, yeast or other fungi, such as Aspergillus . 

The invention further relates to DNA sequences comprising a gene 
30 encoding for the expression of human a 2 M and variants thereof, vectors 
comprising such DNA sequences, and suitable hosts transformed with such 
vectors. 

Yet another aspect of the invention is the use of recombinant 
a 2 M and variants thereof as a protein carrier in enzyme replacement therapy 
35 (ERT). 

Yet another aspect of the invention is the use of recombinant 
a 2 M and variants thereof as a DNA carrier in gene therapy. 

Further aspects of the invention relates to the use of recom- 
binant cr-macrogl obul ins, especially human a 2 M, and variants thereof as 
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constituents of growth media, either as an additive or co-expressed with a 
desired gene product. 

DEFINITIONS 

5 Prior to setting forth the invention it may be helpful for an 

understanding thereof to set forth definitions of certain terms to be used 
hereafter. 

Complementary DNA or cDNA: A DNA molecule or sequence which have been 
10 enzymatically synthesized from sequences present in a mRNA template. 

DNA Construct: A DNA molecule, or a clone of such a molecule, either single- 
or double-stranded, which may be isolated in partial form from a naturally 
occurring gene or which has been modified to contain segments of DNA which 
15 are combined and juxtaposed in a manner which would not otherwise exist in 
nature. 

Plasmid or Vector: A DNA construct containing genetic information which may 
provide for its replication when inserted into a host cell. A plasmid 
20 generally contains at least one gene sequence to be expressed in the host 
cell, as well as sequences encoding functions which facilitate such gene 
expression, including promoters and transcription initiation sites. It may 
be a linear or closed circular molecule. 

25 Joined: DNA sequences are said to be joined when the 5' and 3' ends of one 
sequence are attached by phosphodiester bonds to the 3' and 5' ends, 
respectively, of an adjacent sequence. Joining may be achieved by such 
methods as ligation of blunt or cohesive termini, by synthesis of joined 
sequences through cDNA cloning, or by removal of intervening sequences 

30 through a process of directed mutagenesis. 

Variant: A peptide related to the original peptide, but wherein the amino 
acid sequence has been altered through mutation of the gene encoding the 
original peptide. 

35 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure la illustrates the construction of plasmid pi 136 . 

Figure lb illustrates the construction of plasmid pi 167 . 
35 Figure 2 illustrates the structure of plasmid pi 167 . 

Figure 3 illustrates a gel electrophoresis (10 - 20 % SDS-PAGE) 
of the thermal fragmentation products generated from a 2 M and rajft. 

Figure 4 illustrates a gel electrophoresis of the thermal 
fragmentation products generated from methyl amine treated a 2 M and ra 2 M. 
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Figure 5 illustrates a gel electrophoresis (SDS-PAGE) of the 
reaction products generated from trypsin treatment of or 2 M and ror 2 M. 

Figure 6 illustrates a gel electrophoresis of the reaction produ- 
cts generated from trypsin treatment of methyl ami ne-treated o^M and rafl. 
5 Figure 7 illustrates a "rate gel" electrophoresis of unreacted 

native -and trypsin treated otjft and rajft. 

Figure 8 illustrates a "rate gel n electrophoresis of unreacted 

native -and methylamine treated aJA and ra^. 

Figure 9 illustrates the chromatograms of or 2 M and ra 2 M on a 

lOSuperose 6 column. 

Figure 10 illustrates the gel electrophoresis (10 - 20 % reducing 
SDS-PAGE) of the reaction products from chymotrypsin treated human a 2 M, human 

PZP and ra 2 M-PZP. 

Figure 11 illustrates the gel electrophoresis (10 - 20 % reducing 
15 SDS-PAGE) of the reaction products from elastase treated human afl, human 

PZP and ra^-PZP. 

Figure 12 illustrates the gel electrophoresis (10 - 20 % reducing 
SDS-PAGE) of the reaction products from trypsin treated human a 2 M, human PZP 
and ra 2 M-PZP. 

20 Figure 13 illustrates the gel electrophoresis (10 - 20 % reducing 

SDS-PAGE) of the reaction products from Staphylococ cus aureus Glu-specific 
protease treated human a 2 M, human PZP and ra 2 M-PZP. 



25 DETAILED DESCRIPTION OF THE INVENTION 

According to the invention there is provided a process for the 
production of a-macroglobul ins, especially human a 2 -macroglobul in, or 
fragments or derivatives, including variants thereof, wherein a functionally 
operative expression vector comprising a gene encoding for the expression of 

30 a a-macroglobulin, especially human ot 2 -macroglobu1 in, or fragments or 
derivatives thereof, including variants, or alleles of such a gene, is intro- 
duced into a suitable host capable of expressing said gene, said host is 
cultured in a suitable nutrient medium containing sources of assimilable 
carbon and nitrogen and other essential nutrients, and the expressed a- 

35macroglobulin, especially human a 2 -macroglobul in, or fragments or derivatives 
thereof is recovered. 

Many proteins synthesized particularly in mammalian cells undergo 
post-translational modification (processing) of one kind or the other. 
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Depending on the final destination and on the specific function of a newly 
synthesized protein, it may go through a number of processing steps leading 
to covalent modifications such as e.g.: glycosyl ation, -y-carboxyl ation, B- 
hydroxyl ation, sulphatation, amidation, thiol ester formation, phosphory- 

Slation, proteolytic cleavage at precursor processing sites, fatty acylation 
(Rosner, M.R. (1986). in: "Mammalian Cell Technology", (Thilly, W.G. ed), 
Butterworth Publishers, Stoneham, MA.: 63-89). 

Proteins of various sizes and with a variety of different post- 
-translational modifications have been successfully expressed in transformed 

10 heterologous mammalian host cells using recombinant DNA technology. A few 
examples: Human coagulation factors Vila and IX have been expressed in trans- 
formed BHK (Syrian Baby Hamster Kidney) cells with correct post-transl ational 
modifications such as 7-carboxyl ation and glycosylation (Thim, L. et al . , 
(1988) Biochemistry 27: 7785-7793; Busby, S. et al . , (1985) Nature 316: 271- 

15 273). Human Platelet-derived Growth Factor AB heterodimer has been expres- 
sed in transformed CHO (Chinese Hamster Ovary) cells with correct processing 
of the A and B chain precursors and correct assembly of the AB heterodimer. 
Human coagulation factor VIII has been expressed in transformed CHO cells 
with correct processing of the precursor leading to a two chain molecule that 

20 can be activated by thrombin and factor Xa (Kaufman, R.J. et al . , (1988) J. 
Biol. Chem. 263: 6352-6362; Pittman, D.D. and Kaufman, R.J. (1988) Proc. 
Natl. Acad. Sci . USA 85: 2429-2433). 

So far, there have been no reports on the heterologous expression 
of proteins in which the formation of an active thiol ester is a prominent 

25 post-translational modification. 

The biosynthesis of the internal thiol ester in the third com- 
ponent (C3) of complement from rabbit has been investigated (Iijima, M. et 
al., (1984) J. Biochem. 96: 1539-1546). Rabbit liver mRNA was translated in 
vitro in a rabbit reticulocyte lysate system, and the synthesized C3 specific 

30 products did not incorporate radio labelled methylamine. On the other hand 
radio labelled iodoacetamide reacted with the synthesized C3 specific 
products; these results indicated the presence in the primary C3 specific 
translation product of a free thiol group instead of a reactive thiol ester. 
If a liver homogenate supernatant (S- 13) including cytosol and microsomes was 

35 included, the C3 specific product could now incorporate methylamine. By 
increasing the concentration of the S- 13 component(s) , the incorporation of 
methylamine in C3 specific products was increased, and at the same time 
incorporation of iodoacetamide decreased. If the S-13 fraction was treated 
at 65°C for 5 min, the activity was completely lost. 
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The results from this investigation strongly suggest an involve- 
ment of a transglutaminase-! ike or other type, of enzyme in the posttransla- 
tional formation of an active thiol ester in rabbit C3. There are no similar 
investigations addressing the formation of the thiol ester in other a-macro- 
Bglobulins, e.g. a^, but from analogy and homology considerations, it is 
expected that a similar mechanism is responsible for the formation of thiol 
esters in other a-macroglobul ins synthesized in the mammalian liver. 

Through this investigation a number of developments were done 
10 which also are deemed to be encompassed of the present invention. These 
include DNA sequences comprising a gene encoding for the expression of a- 
macroglobulins, especially human a 2 -macroglobul in, or fragments or deriva- 
tives and variants thereof as exemplified in SEQ ID N0:1 and SEQ ID N0:3. 

Another aspect of the invention relates to functionally operative 
15 expression vectors comprising a gene encoding for the expression of at least 
one a-macroglobul in, especially human a 2 -macroglobul in or fragments or 
derivatives and variants thereof, or alleles of such a gene. 

Such vectors preferably further comprise regulatory elements 
necessary for the stable maintenance of said vector in mammalian cells. 
20 Also, such vectors may further include sequences providing for 

the processing and secretion of the expressed product. 

In relation to the use of recombinant a-macroglobul ins, and 
especially ra 2 M, in growth media it may be co-expressed with another desired 
gene product, and consequently the vectors of the invention may further 
25 comprise one or more other genes encoding for a desired gene product. 

The invention further relates to transformed hosts comprising a 
functionally operative expression vector according to the invention compri- 
sing a gene encoding for the expression of human a 2 -macroglobul in or fragments 

30 or derivatives and variants thereof, or alleles of such a gene. 

The host may be selected from the group comprising a bacterial 
strain, a fungal strain, a mammalian cell line, or a mammal, especially a 
fungus, such as belonging to the genus Aspergillus , or a yeast strain, pre- 
ferably belonging to the genus Saccharomvces . 

35 Another preferred type of host is a mammalian cell line, 

preferably a Syrian Baby Hamster Kidney (BHK) cell line, and especially the 
one which is available from ATCC under No. CRL 1632. 
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The invention further relates to the recombinant human a 2 - 
macrogl obul in or a variant thereof in an active form having the amino acid 
sequence of SEQ ID N0:2, or SEQ ID N0:4. 

5 APPLICATIONS OF a-MACROGLOBULINS « ESPECIALLY raJI. 

The present invention discloses applications of a-macrogl obul ins, 
and especially ra 2 M. These should be regarded not as limitations but as a 
few examples among many for the use of recombinant derived a-macrogl obul ins. 

10 a-MACROGLOBULINS AS CONSTITUENTS OF DEFINED GROWTH MEDIA. 

Degradation of specific heterologous products produced in either 
transformed or non-transformed mammalian cells is a potential problem in the 
production of recombinant products. This is due to the fact that many host 
cells secretes one or more different proteinases. 

15 When a production cell line is grown in the presence of e.g. 10 

% fetal calf serum, such proteolytic degradation of secreted recombinant or 
native protein products is a minor problem due to a buffering effect of the 
added serum proteins. 

However, the use of fetal calf serum in the large scale growth 

20 (fermentation) of mammalian production cell lines is not a desirable 
situation for a number of reasons. First of all fetal calf serum is a very 
costly constituent of complex growth media; second, the demand for fetal 
calf serum from a growing biopharmaceutical industry might not be easily 
fulfilled in the future, and third, the use of fetal calf serum constitutes 

25a potential quality control problem in the production of pharmaceuticals 
intended for use in humans. 

To circumvent these problems, efforts can be expected in the 
field of development of defined growth media for use with mammalian cells. 

Addition of various proteinase inhibitors to such new defined 

30 growth media will be required to ensure the integrity of the secreted 
products. Alternatively, the producer cell line might, through genetic 
engineering, be endowed with the capacity to produce and secrete proteinase 
inhibitors along with the desired product(s). 

a-Macroglobul ins, and especially Human a^, are proteinase 

35 inhibitors of broad specificity, and they are therefore according to the 
invention used as constituents of defined growth media for mammalian cells, 
either as a medium additive or as a product co-produced with the desired 
product. 
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The target sites for a number of different proteinases, e.g. 
bovine trypsin, Streptomvces griseus trypsin, papain, porcine elastase, 
bovine chymosin, bovine chymotrypsin, Staphyloco ccus aureus strain V8 
proteinase, human plasmin, bovine thrombin, thermolysin, subtilisin Novo and 

5 Streptomvces griseus proteinase B have been mapped in the bait region of 
human a 2 M (Mortensen, S.B., et al . , (1981) FEBS Lett. 135: 295-300) and other 
a-macroglobulins (Sottrup- Jensen, L., Sand, 0., Kristensen, L. and Fey, G.H. 
J.Biol .Chem. 264 .15781-15789, 1989). It is evident that ajft and the other a- 
macroglobulins as proteinase inhibitors have broad specificities. 

10 In those situations, where the proteinase inhibitory spectrum 

of a a-macroglobul in, such as a 2 M, is not sufficient for the prevention of 
product degradation, it is possible through site specific mutation, protein 
engineering, etc. to change the proteinase inhibitor specificity of the a- 
macroglobulin, such as ajft. Incorporation of desirable specific proteinase 

15 target sites in the bait region of recombinant a 2 M will change the inhibitor 
specificity of the mutated a 2 M. Furthermore it is possible through genetic 
engineering to construct novel specific or general proteinase target sites 
in the bait region of a a-macroglobul in in order to enhance its versatility 
as a proteinase inhibitor of specific or broad inhibitory spectrum. 

20 Furthermore it is possible to remove specific target sites in an a- 
macroglobul in in order to avoid degradation of the variant in question by 
certain proteases in the circulation that will already be inhibited through 
the action of naturally present proteinase inhibitors. 

The production of recombinant products in fungi, such as species 

25 and strains of e.g. Aspergillus and Saccharomvces also meets with potential 
problems of product degradation. In some cases it is possible to isolate 
proteinase negative mutants of desirable production strains. This might not 
always be the case, and co-expression of a-macroglobulins, such as a 2 M or 
a 2 M-mutants together with a desirable product may inhibit proteolysis of the 

30 product in question. 

a-MACROGLOBULIN MUTANTS AS SPECIFIC PROTEINASE INHIBITORS. 

The amino acid sequence of the bait region of a-macroglobulins 
defines the specificity of the a-macroglobul in towards different proteina- 
35ses. A comparison of cleavage patterns for different proteinases and bait 
region sequences in five mammalian a-macroglobulins has recently been 
published (Sottrup-Jensen, L., Sand, 0., Kristensen, L. and Fey, G.H. The 
a-macroglobul in bait region. Sequence diversity and localization of cleavage 
sites for proteinases in five mammalian a-macroglobulins. J. Biol . Chem. 264, 
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15781-15789, 1989). It has previously been clearly demonstrated that the bait 
region in each species of or-macroglobul in is the major determinant of 
proteinase inhibitor specificity. The present invention demonstrates the 
possibility of modulating the inhibitor specificity of human ajfi by 
5 alterations of proteinase target sites in the bait region. 

In the present invention it is demonstrated that the bait region 
of human ajfi (residues 690 to 730 in SEQ ID N0:2) can be mutated at will to 
obtain a new proteinase inhibitor profile of this macroglobulin. The example 
presented in the present invention describes the construction of a hybrid 

10 macroglobul in. In this hybrid the bait region from human pregnancy zone 
protein (PZP) was introduced into human ajft, from which the native bait region 
had been removed. The hybrid molecule, which was constructed by the use of 
recombinant DNA technology, revealed a proteinase inhibitor profile similar 
to the inhibitor profile of PZP. 

15 The invention thus demonstrates the possibility to design and 

produce proteinase inhibitors with altered and new inhibitor specificities 
at will . 

This finding is important for the design of new proteinase 
inhibitors. Due to the low antigenicity the bait region in macroglobulins 

20 (Van Leuven, F., Marynen, P., Cassiman, J. -J. and Van den Berghe, H. Mapping 
of structure-function relationships in proteins with a panel of monoclonal 
antibodies. A study on human alpha-2-macroglobulin. J. Immunol . Methods 111 , 
39-49, 1988, and Delain, E. , Barray, M. , Tapon-Bretaudiere, J., Pochon, F. , 
Marynen, P., Cassiman, J. -J., Van den Berghe, H. and Van Leuven, F. The 

25Molecular Organization of Human alpha2-Macroglobul in . An Immunoelectron 
microscopic study with monoclonal antibodies. J. Biol . Chem. 263 , 2981-2989, 
1988) it is now possible, by the use of the technology described in the 
present invention, to design non-immunogenic new proteinase inhibitors that 
can be used e.g. in the treatment of any disease, where aggressive proteina- 

30 ses constitute a threat to the health of man. 

In the present specification the production of a 2 M variants is 
described by the construction of a hybrid macroglobulin. It is clear to the 
skilled person in the art that changes also could be obtained through other 
genetic engineering methods, such as described in International Publication 

35 No. W0 89/06279 (NOVO INDUSTRI A/S). Also it is clear that other a- 
macroglobul ins could be employed instead of the human a 2 M, such as those 
mentioned in Sottrup-Jensen, L. et al . (1989), supra . 
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rftJJ AS A PROTEIN CARRIER IN ENZYME REPLACEMENT THERAPY, 

A different application of is .its use as a carrier of macro- 
molecules such as proteins and nucleic acids. When or^ reacts with and forms 
a complex with a proteinase in solution, or 2 M may bind other proteins (also 

5 non-proteinase proteins) present in that solution (Salvesen, G.S. et al . , 
(1981) Biochem. J. 195: 453-461). In the case of Fabry's disease, which is 
an X-chromosome linked disorder of glycosphingol ipid metabolism, it has 
recently been demonstrated that oc 2 M can function as a carrier in an in vitro 
model of enzyme replacement therapy (ERT) (Osada, T., et al., (1987) Biochem. 

lOBiophys. Res. Commu. 142: 100-106). otjft was conjugated to coffee bean a- 
galactosidase through the action of trypsin, and the formed complex was 
internalized through a 2 M-receptor specific (Van Leuven, F., et al., (1981) J. 
Biol. Chem. 256: 9016-9022) endocytosis and delivered to the lysosomes, which 
is the target organelle for a^-receptor mediated internalization of a 2 M- 

15proteinase complexes (Willingham, M.C. and Pastan, I., (1980) Cell 21: 67- 
77). 

Such a scheme in ERT provides a method of internalization to the 
lysosome of the enzyme in question and at the same time it might alleviate 
potential antigenicity problems arising from the use of heterologous enzymes 

20 in therapy. One limitation in this type of ERT (Osada, T., et al., (1987) 
Biochem. Biophys. Res. Commu. 142: 100-106) would be the types of potential 
target cells that could be treated by this protocol. Obviously, they would 
have to express the a 2 M-receptor. In a future development of the system, the 
possibility might exist to redesign the cell specificity of a 2 M internal iza- 

25tion by exchanging the receptor binding domain of cr 2 M with other receptor 
ligands. Hereby a^-mutants could be designed to enter any cell type known 
to express a specific internal izable receptor. 

This type of development would of course require a system for 
the production of recombinant derived o^M. The use of native human a 2 M as a 

30 carrier in ERT (as described above) is undesirable due to the now well known 
risks of the employment of blood derived products in the treatment of human 
disease. 

The production of recombinant a 2 M in accordance with the present 
invention alleviates this problem by providing for large scale production 
35 of ra^. 

rft.M AS A DNA CARRIER IN GENE THERAPY. 

Advances in gene transfer into mammalian cells have opened for 
the possibility of the treatment of a number of genetic disorders through 
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gene therapy. A major problem in gene therapy will be the specific targeting 
of genes into the appropriate cells within the body. (Williamson, B., (1982) 
Nature 298: 416-418; Anderson, W.F., (1984) Science 226: 401-409; Parkman, 
R., (1986) Science 232: 1373-1378). 
5 It was recently described that a constructed foreign gene 

containing the chloramphenicol acetyl transferase (CAT) on a bacterial plasmid 
could be targeted to the liver of rats by specific receptor directed 
internalization (Wu, G.Y. and Wu, C.H. (1988) J. Biol. Chem. 263: 14621- 
14624). The DNA carrier consisted of a galactose-terminal (asialo)glyco- 

10 protein and asialoorosomucoid covalently linked to poly-L-lysine. The 
polycation poly-L-lysine can bind DNA in a strong non-covalent and nondamag- 
ing interaction. It was demonstrated that complex bound DNA was internalized 
by cell -surface asialoglycoprotein receptors that are unique to hepatocytes. 
The complex was injected intravenously, and upon analysis only the liver 

15 expressed the CAT activity. 

In the present invention the use of ra^ as a carrier of DNA in 
gene therapy is suggested. Reaction of ra 2 M with a proteinase such as trypsin 
or with methylamine in the presence of covalently closed circular plasmid DNA 
is likely to result in partial or total entrapment of DNA within the 

20complexing a 2 M molecule. After intravenous injection of such complexes with 
exposed receptor binding domains, the complex will be rapidly cleared from 
the blood and internalized in specific target cells, such as hepatocytes and 
Kupffer cells. Through protein engineering on the receptor binding domain of 
ra 2 M it will be possible to design a DNA carrier specific for other cell 

25 types. The advantage in this system as compared to the above described system 
using the asialoglycoprotein receptor is, that it will not be necessary to 
identify different DNA carrier systems for each new cell type. 

30 EXAMPLES 

Materials and methods: 
Microorganisms and cell lines 

E. col i K12 (MC1061) is available from e.g. Stratagene Inc., 
35 11099 North Torrey Pines Rd., La Jolla, California 92037. 

HepG2 (Human hepatoblastoma cell line) is freely available from 
American Type Culture Collection, under No. HB 8065. 

BHK (Syrian Hamster Kidney cell line, thymidine kinase mutant 
line tk^s^, (Waechter and Baserga (1982) Proc. Natl. Acad. Sci. USA 79: 
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1106-1110); is freely available from American Type Culture Collection, under 
No. CRL 1632. 

Plasmids and vectors 

5 

Plasmids pCDVI-PL and pSP62-K2 are available from Dr. Tasuku 
Honjo, Faculty of Medicine, Kyoto University, Kyoto 606, Japan. pSP62-K2 was 
derived from the plasmid pSP62-PL (available from New England Nuclear/Du 
Pont (U.K. ) Ltd., Wedgwood Way, Stevenage, Hertfordshire, SG14QN) as 

10 described (Noma et al . , (1986) Nature, 319: 640-646). pCDVI-PL was derived 
from pcDVl (Okayama, H. and Berg, P. (1983) Molec. cell. Biol. 3: 280-289) 
as described (Noma et al . , (1986) Nature, 319: 640-646). 

M13mpl8 is available from Pharmacia LKB Biotechnology (catalog 
# 27-1552-01) (Norrander, J., Kempe, T. and Messing, J. Gene 26: 101-106, 

15 1983). 

M13mpl9 is available from e.g. International Biotechnologies, 
Inc., P.O. Box 9558, 275 Winchester Avenue, New Haven, Connecticut 06535, 
USA. 

pDHFR-I is available from Dr. K.L.Berkner, ZymoGenetics Inc., 
20 4225 Roosevelt Way NE, Seattle, Washington 98105. (The construction of this 
plasmid is given in detail in: Berkner, K.L. and Sharp, P. A. (1984) Nucleic 
Acids Res. 12: 1925-1941). The molecular cloning of the DHFR cDNA present 
in this plasmid, and its sub-cloning in mammalian expression vectors under 
the control of adenovirus derived promoters has previously been described 
25 in detail (Chang, A.C.Y., et al . , Nature 275: 617-624 and Kaufman, R.J. and 
Sharp, P. A. (1982) Mol . Cell. Biol. 2: 1304-1319) . The backbone plasmid in 
pDHFR-I is pBR322 (Sutcliffe, J.G. (1979) Cold Spring Harbor Symp. Quant. 
Biol. 43: 77-90; Sutcliffe, J.G. (1978) Nucleic. Acids Res. 5: 2721-2728). 

pUC13 is described in: Vieira, J. and Messing, J.: 1982, Gene 19: 
30 259-268 and available from Pharmacia LKB Biotechnology (catalog # 27-4954- 
01). 

pUC19 is described in: Yanisch-Perron, C. and Messing, J., 1985, 
Gene 33:103-119 and available from Pharmacia LKB Biotechnology (catalog # 
27-4951-01). 

35 
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Growth media 
LB-broth: 

Mix 227 g Bacto Tryptone, Difco 0123-01 

113.5 g Yeast extract, Difco 0127-01, and 
5 227 g NaCI in a sealable plastic container. 

Add 12.5 g mix to 500 ml water in a 1000 ml bottle, shake well and sterilize 
in an autoclave. 

Dulbeccos Modified Eagle Medium is available from e.g. Gibco Ltd. 
10 P.O. Box 35, Trident House, Renfrew Road, Paisley PA34EF, Renfrewshire, 
Scotland. Cat.# 042-250 1M (10 * concentrate). 

Antibodies 

15 Anti-a 2 M A033 and peroxidase conjugated anti-or^ PE326 were from 

DAKOPATTS A/S, Copenhagen, Denmark. 

EXAMPLE 1. 

CLONING AND SEQUENCE DETERMINATION OF HUMAN a„M 

20 

Preparation of messenger RNA from the human cell line HepG2. 

The human hepatoblastoma cell line HepG2 (American Type Culture 
Collection No. HB 8065, freely available) was used as a source for mRNA 
preparation. HepG2 cells were grown to a total cell number of 15 * 10 7 in 
25Dulbecco's Modified Eagle medium containing 10% fetal calf serum and 
antibiotics. 

Total RNA was isolated by the guanidinium thiocyanate method 
(Chirgwin et al., (1979) Biochemistry 18: 5293-5299) and purified by CsCl 
gradient centrifugation. A total of 3000 /ig RNA was obtained. mRNA was 

30 isolated by use of an oligo(dT)-cellulose column (Aviv & Leder (1972) Proc. 
Natl. Acad. Sci . USA 69: 1408-1412). 60 /ig of mRNA was obtained after one 
cycle of affinity chromatography. After ethanol precipitation, this 
preparation of mRNA was resuspended in 10 mM Tris-HCl pH 7.5, 0.1 mM EDTA- 
Na 2 at a final concentration of 1 /ig//il and stored at -80°C for subsequent 

35 use in the construction of a cDNA library. 

Construction of a cDNA library from HepG2 mRNA. 

A cDNA library was constructed in the pCDVI-PL/pSP62-K2 vectors 
(Noma et al . , (1986) Nature, 319: 640-646. Available from Dr. Tasuku Honjo, 
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Faculty of Medicine, Kyoto University, Kyoto 606, Japan) by use of the 
methods described by Okayama & Berg (Mol. Cell. Biol. 2: 161-170 (1982); 
Mol. Cell. Biol. 3: 280-289 (1983)). 

E. coli K12 (MC1061) (Casadaban & Cohen (1980) J. Mol. Biol. 
5 138: 179-207) was used for transformation. MC1061 were grown in L-broth at 
37°C to OD^O.5. Twenty ml were centrifuged, and the pellet was resuspended 
in 7 ml of ice-cold sterile 0.1 M CaCl 2 , incubated on ice for 30 minutes, 
centrifuged briefly, and finally kept in the cold room overnight. 

Ninety-five /il suspension of transformation-competent E. coli 
10MC1061 were added per 10 /-tl of cDNA preparation. The mixture was incubated 
on ice for 30 minutes, heat-shocked at 43,5°C for 45 seconds, and finally, 
after addition of L-broth, incubated at 37'C for 30 minutes. 

After resuspension, the cells were plated onto L-broth plates 
containing ampicillin (50 /xg/ml) and grown for 8 hrs at 37°C. A total of 2.9 
i5*10 5 individual colonies could be obtained from this library. 

Screening of the HepG2 library for cDNA clones encoding human qJI. 

5 * 10* individual colonies were screened by standard colony 
hybridization technique using nitrocellulose filters (Maniatis et al . , (1982) 
20 Molecular Cloning - A Laboratory Manual, Cold Spring Harbor, New York). 

A 20-mer oligonucleotide mixture 

5' CC(T/C)TTCAT(G/A)TC(T/C)TC(T/C)TG(T/C)TT 3' 
where the notation (X/Y) means that either of the nucleic acids X or Y may 
be used, complementary to the human a 2 M mRNA in the region encoding amino 
25 acid residues Lys-Gln-Glu-Asp-Met-Lys-Gly (residues number 493 - 499 in 
Sottrup-Jensen et al . , J. Biol. Chem. 259: 8318-8327 (1984) was synthesized 
(on a DNA synthesizer from Applied Biosystems, USA), labelled with ^P (using 
1 A polynucleotide kinase and y-^P-ATP) to a specific activity of 3 * 10 s 
cpm/pmol oligonucleotide. The labelled oligonucleotides were purified by gel 
30 chromatography and subsequently used in the screening of the cDNA library. 

The hybridization solution contained 6 * SSC, 5 * Denhardt's 
solution, 0.05% SDS (Maniatis et al., (1982) Molecular Cloning - A Laboratory 
Manual, Cold Spring Harbor, New York) and 10 B cpm/ml of labelled oligo- 
nucleotide mix. 

35 Hybridization was performed for 3 hrs at 45°C. Then the filters 

were washed in 6 * SSC, 0.05% SDS at 45°C for 3 * 10 minutes. After- autora- 
diography the filters were washed under the same conditions, but this time 
at 52*C. A colony that still showed hybridization at this temperature was 
isolated and the cDNA insert of the corresponding plasmid (designated po^) 
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from this isolate was sequenced (Tabor & Richardson (1987) Proc. Natl. Acad. 
Sci. USA 84: 4767-4771). The sequence of the cDNA and the derived encoded 
amino acid sequence are shown in the appended sequence listings, SEQ ID 
N0:1:, and SEQ ID N0:2:. 

5 

Characterization of DQfoM. 

por 2 M had a cDNA insert of approximately 4.6 kb. Its sequence is 

given in Table I above. 
10 The sequence in Table I demonstrates that the entire coding 

region of a 2 M including the signal peptide is found in the insert. 

In addition to the coding region, the insert contains sequences 

derived from the 5'- and 3' untranslated regions of the a 2 M mRNA molecule. 

The amino acid sequence of the human a 2 M as deduced from the cDNA 
15 in parjjM is in total agreement with the published sequence (Sottrup-Jensen et 

al., (1984) J. Biol. Chem. 259: 8318-8327). Codon number 1000 (numbered from 

the initiating methionine codon in the signal peptide) was found to be ATC 

encoding an isoleucine and not GTC (encoding a valine) as found in an a 2 M cDNA 

synthesized from human liver mRNA (Kan et al . , (1985) Proc. Natl. Acad. Sci. 
20 USA. 82: 2282-2286). In the aj<l cDNA sequence from the HepG2 library we have 

further identified ten silent changes as compared to the sequence from the 

liver library, see the following Table I: 
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TABLE I 



5 


Codon 


Li ver 






413 (Asn) 


A Af* 

AAL 






495 (Phe) 


TTT 


TTC 


10 


750 (Gly) 


GGb 






796 (Leu) 


LI 1 




15 


Boa ^Lcu; 


TTT 


CTA 




1266 (Ala) 


GCC 


GCA 




1296 (Asn) 


AAT 


AAC 


20 


1326 (Thr) 


ACC 


ACA 




1442 (Leu) 


CTC 


CTG 


25 


1460 (He) 


ATC 


ATT 



The position of the oligonucleotide mixture used as a hybridiza- 
tion probe in the colony screenings was from position 1574 to position 1594, 
30 and the position of the reactive thiol ester is from position 2939 to 2953 
in SEQ ID N0:1. 



EXAMPLE 2. 

Construction of a mammalian expression vector for aji. 

35 pa 2 M was digested (fig. la) with Xbal and EcoRI, and a 1.2 kb 

fragment containing the 5' part of the afl cDNA together with the multiple 
cloning site of pSP62-K2 was isolated on an agarose gel and cloned in an 
Xbal/EcoRI digested M13mpl9 vector to generate M13mpl9A. To facilitate 
further subclonings of the a 2 M cDNA, a unique EcoRV site was introduced in 

40 the 1.2 kb fragment 10 nucleotides 5' to the initiating ATG (methionine) 
codon through site directed mutagenesis (Kunkel et al . , (1987) Methods 
Enzymol. 154: 367-382). In the same mutagenesis experiment, in which the 
mutagenic oligonucleotide N0R593: 

5 ' (TTCTTCCCCATGGTGGATATCGAAGGAGCTG ) 3 ' 

45 was used, the 5 nucleotides 5' to the methionine codon was changed to 
CCACCATG; this mutation creates a new Ncol site spanning the ATG codon. A 
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correct mutant M13mpl9B was identified through restriction enzyme digestion 
and DNA sequencing. 

The mutated 5' end of a 2 M cDNA was isolated from M13mpl9A repli- 
cative form through digestion with HindHI and Eco RI and agarose gel electro- 
5 phoresis. The isolated DNA fragment was then joined to Hi ndl I I/EcoRI digested 
potgM through ligation to generate pi 136 . In this plasmid the a 2 M cDNA is 
reassembled in its total length, but now with a unique EcoRV site at the 5' 
end. pi 136 was digested with EcoRV/Dral. and the o^M fragment was isolated on 
an agarose gel and cloned in a mammalian expression vector under control of 

10 the adenovirus 2 major late promoter (Ad 2 MLP). 

The adenovirus-promoter based vector was constructed by K.L.Berk- 
ner (ZymoGenetics Inc., Seattle, WA.), and a detailed description of the 
functional elements in the mammalian expression vector is given in: Powell, 
J.S. et al., (1986) Proc. Natl. Acad. Sci . USA 83: 6465-6469 and in: Boel 

15 et al., (1987) FEBS Lett. 219: 181-188). 

The expression vector used for expression of human o^M was 
generated from the mammalian expression vector pPP (Boel, E. et al., (1987) 
FEBS Lett. 219: 181-188), in which human pancreatic polypeptide cDNA was 
cloned under control of Ad 2 MLP. 

20 pPP was digested (fig. lb) with BamHI and the resulting stag- 

gered ends were repaired with DNA polymerase (Klenow fragment and the four 
deoxynucleotide triphosphates). The 4.5 kb EcoRV/Dral a 2 M cDNA fragment was 
joined to this vector through ligation, and correct recombinants were 
characterized through restriction enzyme analysis on isolated miniprep. 

25 pi asmids. 

The a 2 M-mRNA transcribed from the resulting 8.76 kb plasmid 
(designated pll67 (fig. 2)) has the adenovirus 2 late tripartite leader (Ll- 
3) at its 5' end together with an mRNA splice signal (SS). At the 3' end of 
the construct the transcript is terminated with the SV40 late termination - 
30 and polyadenylation signal. 5' to the Ad 2 MLP the construct includes the 
SV40 enhancer (ENH) and the 0 to 1 (0 - 1) map units from adenovirus 5. 

Expression of g„M in mammalian cells. 

For expression of human a 2 M in cultured BHK cells (Syrian Hamster 
35 Kidney, thymidine kinase mutant line tk" l sl3, (Waechter and Baserga (1982) 
Proc. Natl. Acad. Sci. USA 79: 1106-1110); American Type Culture Collection 
CRL 1632) the expression vector pi 167 was co-transfected with pDHFR-I (Berk- 
ner, K.L. and Sharp, P. A. (1984) Nucleic Acids Res. 12: 1925-1941. Available 
from K.L.Berkner, ZymoGenetics Inc. Seattle) into subconfluent cells by the 
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calcium phosphate mediated transfection procedure (Graham and Van der Eb 
(1973) Virology 52: 456-467). In the transfect.ion experiment the molar ratio 
between pll67 and pDHFR-I was 10:1. Cells were grown in Dulbeccos Modified 
Eagle Medium supplemented with 10% fetal calf serum (FCS). 

5 Forty-eight hours after transfection, cells were trypsinized and 

diluted into medium containing 400 nM methotrexate (MTX). After 10 to 12 
days, individual colonies were cloned out and expanded separately. The 
expanded cultures were propagated for 24 hours as described above, and 
producer clones were identified using an enzyme linked immunosorbent assays 

10(ELISA) (Munck Petersen C, et al., (1985) Scand. J. Clin. Lab. Invest. 45: 
735-740) against human cr 2 M secreted to the growth medium. 

Description of the or ? M ELISA assay. 

The materials used in the ELISA were: 
15 Catching antibody A033 anti-a^, 

Peroxidase-conjugated anti-a 2 M antibody PE326, 

1,2-Phenylenediamine, dihydrochloride (0PD) 

all from DAKOPATTS A/S, Copenhagen, Denmark. 

Urea peroxide, 125 mg, was from Organon Teknika. 
20 96 well ELISA plates were from NUNC, Copenhagen. 

Coating buffer: 

100 mM carbonate buffer pH 9.6 was made up as follows: 
Add 3.18 g Na 2 C0 3 and 5.96 g NaHC0 3 to 1000 ml water. 

25 

Standard and sample buffer: 

To 100 ml of 150 mM phosphate buffer pH 7.2 was added: 

50 fil Tween 20 

2 g Bovine Serum Albumin (Sigma A 7030). 

30 

Washing buffer: 

10 mM sodium phosphate pH 7.4 
145 mM sodium chloride 
0.1 % Tween 20. 

35 

Citric acid-phosphate buffer, pH 4.9: 

The following reagents were added to 1000 ml of water 
7.3 g citric acid 
23.88 g Na 2 HP0 4 , 12 H 2 0 
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0.5 ml Tween 20 

The buffer was used for a maximum of 14 days, stored at 4°C. 

Urea peroxide solution: 
5 125 mg urea peroxide was dissolved in 8.93 ml water. 

The solution was kept in the dark at 4°C. 

Coating of the plates for assay: 

The 95 well plate was coated with 175 ft of the DAKO A033 
10 antibody diluted 1:1000 in the coating buffer. The plate was incubated over 
night at 4°C. Before use the plate was washed 4 times in washing buffer. 

Application of standards and samples: 

100 ft standard or sample was added to each well. As a standard 

15 purified human a 2 M, 2 mg/ml (prepared as described in: Sottrup-Jensen et al., 
(1983) Ann. N.Y. Acad. Sci . 421: 41-60) was used. The standard curve included 
the following serial dilutions: 1:4000, 1:8000, 1:16000 etc. down to 
1:1024000, corresponding to final concentrations from 500 pg/1 down to 1.95 
/jg/1 . All dilutions were done in the Standard and sample buffer. The plate 

20 was incubated over night at 4°C and then washed 4 times with wash buffer 
before the next step. 

Addition of conjugated antibody: 

100 ft of PE326, which had been diluted 1:6000 in the Standard 
25 and sample buffer, was added to each well. The plate was incubated for 2 h 
at 20°C, and then washed 4 times with wash buffer. 

Enzyme activation: 

8 mg of 0PD was dissolved in 12 ml of Citric acid- phosphate 
30 buffer. To this solution 500 ft Urea peroxide solution was added and the 
mixture was used immediately. 100 ft of the final solution was added to each 
well, and the plate was incubated in the dark for 6 min. Then 100 ft of 2 M 
H 2 S0 4 was added to each well and the A^ was read in an automated ELISA plate 
reader. 

35 

The above described ELISA did not give any background on medium 
supplemented with 10% FCS, nor did it give any background in BHK cell 
conditioned medium. Of 24 isolated MTX resistant clones, 16 produced 
detectable amounts of recombinant a^. 
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Selected cell lines that secreted 12.3 mg/1 (K16-6) and 19.1 
mg/1 (K17-6) in the supernatant (grown in a 6 well NUNC-plate) over a 48 
hour period were expanded for large scale production of recombinant human 
(rc^M). 

5 

Purification of recombinant human ajft. 

Cell lines K16-6 and K17-6 were each expanded into one ten- 
double tray (NUNC, Denmark) with a growth surface of 6000 cm 2 . At 80% 
confluency the medium on the cells was changed from containing the 10% fetal 
10 calf serum (FCS) down to 2%. After 48 hours of growth in medium with only 2% 
(FCS), the medium was removed, and the cells were washed twice with serum 
free medium. Cells were then grown serum free for 4 to 5 days with change of 
serum free medium every two days. Conditioned medium was pooled and analyzed 
for ra 2 M by ELISA. 

15 The pooled conditioned medium from K16-6 and from K17-6 contained 

7.15 mg/1 and 21.5 mg/1 of ra 2 M, respectively. 

The rc^M was purified according to published procedures (Sottrup- 
Jensen et al . , (1983) Ann. N. Y. Acad. Sci. 421: 41-60). Briefly the 
conditioned medium was loaded onto a 10 ml Zn-Chelate column (Zn 2+ - 

20 iminodiacetic acid Sepharose 4B (Porath, J. et al . , (1975) Nature 258: 598- 
599) equilibrated with 25 mM Tris-HCl pH 8.0, and washed with 100 ml 
phosphate buffered saline (PBS) pH 7.2 until A^ < 0.036. A second wash with 
20 mM sodium phosphate, 500 mM NaCl pH 6.2 was performed until A^ < 0.033. 
The flow rate was 100 ml/hr and 3 ml fractions were collected. m 2 M was eluted 

25with 100 mM EDTA pH 7.0 at a flow rate of 40 ml/hr. During elution 1 ml 
fractions were collected. 

Recovery of rct^ was 44%. The ra 2 M containing fractions were con- 
centrated to 1 ml on an Amicon devise equipped with a PM 10 membrane and 
then loaded onto a Superose 12 gelfiltration column (25 mM Tris-HCl, 150 mM 

30 NaCl pH 8.0). The ra^ containing fractions were pooled and stored at -20°C 
until analysis. 

EXAMPLE 3. 

Characterization of recombinant human rotoM. 

35 

A. Chemical reactions at the thiol ester: thermal fragmentation and 

methvlamine induced cleavage. 
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A number of different analyses were performed to evaluate the 
structural and biological characteristics of the human rajft as compared to 
a preparation of human plasma derived a^, designated preparation LSJ39. 

An important structural feature of a 2 M is the presence of the 
5 thiol ester. When heated to 95 e C for 15 min, the thiol ester will induce a 
peptide bond cleavage in the backbone of ctjft at the position of the thiol 
esterified Glx-residue. This results in the fragmentation of the 180 kD ctj* 
monomer into two polypeptides of 120 kD and 60 kD. Fig. 3 shows an analysis 
of both the purified rajft (from two transformed BHK cell lines) and the 

10 purified human plasma derived preparation LSJ39 on a 10-20% SDS polyacryl- 
amide gel. The different preparations, either native human or BHK cell 
derived recombinant a 2 M were all heat treated to induce thermal fragmenta- 
tion before loading onto the gel. Molecular weight markers (from top to 
bottom: 180, 120, 92, 60, 43, 26, 14 and 6 kD) were applied to lanes 1 and 

15 8. Samples in lanes 2, 3 and 4 were not reduced before electrophoresis, while 
samples in lanes 5, 6 and 7 were reduced. Preparation LSJ39 was applied to 
lanes 2 and 5. ra 2 M K16-6 was applied to lanes 3 and 6, and ra 2 H K17-6 was 
applied to lanes 4 and 7. 

It was clear from the patterns of protein fragments on the gel, 

20 that both human or 2 M and the two ra 2 M preparations showed a considerable degree 
of thermal fragmentation. As expected, only the reduced samples displayed 
this fragmentation. In the nonreduced samples, the molecules migrated as the 
360 kD dimer. 

In the human plasma derived preparation LSJ39 (lane 5) a fragment 
25 migrating slightly faster than the 60 kD fragment could be observed. Lanes 
6 and 7 indicated the presence in the recombinant material of a similar 
faster migrating fragment. It is possible that this fragment represented a 
slightly underglycosylated variant of the 60 kD fragment. 

Methyl amine (MA) and other small nitrogen containing nucleo- 
30 philes will cleave the thiol ester and thereby inactivate the ester (Sottrup- 
Jensen, L., et al., (1980) FEBS Lett. 121: 275-280; Salvesen, G.S. et al., 
(1981) Biochem. J. 195: 453-461). After MA induced inactivation of the thiol 
ester, thermal fragmentation of a 2 M can no longer be observed. 

Fig. 4 shows a SDS-PAGE run similar to that shown in Fig. 3 (with 
35 respect to loaded samples), in which applied a 2 M and ra 2 M had been pretreated 
with MA. From this gel it was concluded, that the thiol ester of ra 2 M was just 
as susceptible to cleavage with MA as the thiol ester of native a 2 M. Upon 
reduction MA-treated a 2 M and ra 2 M migrated as a single 180 kD monomer species. 
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Lanes 5 of both Fig. 3 and 4 shoved an additional band of 
approximately 85 kD. When a 2 M is cleaved in the bait region by proteinases 
present in the blood, it generates two fragments, each with a molecular 
weight of 85 kD. The human a 2 H preparation LSJ39 (purified from serum) 
5 contained these cleavage products, while they could not be detected on this 
gel in the two rajft preparations. This indicated that the material secreted 
from the transformed BHK cell lines was largely native uncomplexed a 2 M. Any 
ctjft molecules, that have reacted with proteinases are inactivated and can 
not form additional complexes with other proteinases. Since the BHK cell 
10 does not produce any proteinases that forms complexes with the ra 2 M product, 
this cell is therefore well suited for production of recombinant human ar 2 M. 

B. Reaction with trypsin. 

Reaction with trypsin is a standard way of analyzing the proteinase-complex 

15formation ability of (Sottrup-Oensen, L. (1987) in: "The Plasma Proteins" 
(Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, FL; Harpel , 
P.C. (1973) J. Exp. Med. 138: 508-521; Harpel, P.C., et al . , (1979) J. Biol. 
Chem. 254: 8869-8878; Swenson, R.P. and Howard, J.B. (1979) J. Biol. Chem. 
254 : 4452-4456). In this reaction trypsin will cleave at its target site(s) 

20 in the bait region of a z M, and the resulting reduced cleavage products (85 kD) 
will migrate as a double band. Under nonreducing conditions the trypsin-a^ 
complexes will migrate as high molecular weight products. 

Fig. 5 shows the result of such an analysis (performed as 
described (Sottrup-Jensen, L. (1987) in: "The Plasma Proteins" (Putnam, F.W., 

25ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, FL; Harpel, P.C. (1973) 
J. Exp. Med. 138: 508-521; Harpel, P.C. , et al . , (1979) J. Biol. Chem. 254: 
8869-8878; Swenson, R.P. and Howard, J.B. (1979) J. Biol. Chem. 254: 4452- 
4456)) on the native human ajfi preparation LSJ39 (lanes 2 and 5) and on ror 2 M 
from cell lines K16-6 (lanes 3 and 6) and K17-6 (lanes 4 and 7). The samples 

30 in lanes 2, 3 and 4 were not reduced before electrophoresis, while the 
samples in lanes 5, 6 and 7 were. Lane 5 shows that almost all of the human 
native a 2 M was cleaved with trypsin, while the two preparations of ra^ were 
cleaved with an efficiency of approximately 80% or more. Without reduction 
of the complexes no low molecular weight products from the reaction between 

35 trypsin and the native or the BHK cell derived ra 2 M were seen on the gel. 
The 85 kD fragments derived from the recombinant material migrated somewhat 
faster than the human standard; as mentioned above the recombinant materi- 
al might be slightly underglycosylated. 
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When a 2 M is reacted with methylamine, the thiol ester will be 
inactivated, and a 2 M changes conformation from the "slow" form to the "fast" 
form (Sottrup-Jensen, L. (1987) in: The Plasma Proteins (Putnam, F.W., ed.) 
2nd Ed., 5: 191-291, Academic Press, Orlando, FL; Van Leuven, F., Cassiman, 
5J.-J. and Van Den Berghe, H. (1981) J. Biol. Chem. 255: 9016-9022). In this 
conformation it can no longer react rapidly with or form complexes with 
proteinases such as e.g. trypsin. 

Fig. 6 shows the results of a set of experiments that were run 
in parallel to the experiments described above and shown in Fig. 5. However, 
10 before reaction with trypsin the native human a 2 M and the ra 2 M used in this 
experiment had been treated with methylamine (Sottrup-Jensen, L., et al., 
(1980) FEBS Lett. 121: 275-280). Under these conditions both the native a 2 M 
and the ra 2 M show a marked decrease in reactivity towards trypsin (80% or 
more of the a z M and ra 2 M monomers were migrating as a 180 kD polypeptide). 
15 This indicates that trypsin does not rapidly cleave at the bait region in 
methylamine treated human a 2 M or in BHK cell derived ra 2 M. 

In these types of experiments BHK cell derived rcr 2 M has shown 
characteristics similar to those of native human a 2 M. 

20 C. Trypsin and methylamine induced conformational change in g ? M. 

As mentioned above the or 2 M molecule will undergo a conformational 
change both through complex formation with proteinases and through methyl- 
amine induced cleavage of the thiol ester. The change in structure results 
in an altered mobility on rate gels (Sottrup-Jensen, L. (1987) in: The Plasma 

25 Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, 
FL; Van Leuven, F., Cassiman, J. -J. and Van Den Berghe, H. (1981) J. Biol. 
Chem. 256: 9016-9022); unreacted <> 2 M will migrate as a "slow" form, while 
reacted a^l will migrate as a "fast" form. 

Fig. 7 and Fig. 8 show these conformational changes, as they 

30 appear after reaction with trypsin and methylamine, respectively (analyzed 
on 5-10% rate gels). 

Lanes 1 on both gels contain purified human pregnancy zone 
protein (PZP) (Sand, 0. et al . , (1985) J. Biol. Chem. 260: 15723-15735), 
which is known to appear in both a dimeric (D) and a tetrameric (T) 
35 configuration. 

Lanes 2 on both gels contain unreacted human a 2 M preparation 
LSJ39. Lanes 3 on both gels show the fast migrating form, resulting from 
reaction with trypsin and methylamine, respectively. Lanes 4 on both gels 
show the unreacted ror^ preparation K16-6, and lanes 5 show the corresponding 
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fast forms. Lanes 6 on both gels show the unreacted ror 2 M preparation K17- 
6, and lanes 7 show the corresponding fast forms. 

It can be concluded that both complex formation between rotjfl and 
trypsin and reaction of ra£l with methyl amine result in the appearance of 
5 fast migrating structures. These structures appear (as analyzed on rate gels) 
to be very similar to the structures obtained when human a 2 M was allowed to 
react with trypsin and methylamine. It is also evident from these figures 
that the ra 2 M proteins showed a migration, which, when compared to the 
migration of dimeric and tetrameric PZP on the gels, is in agreement with the 
10 finding that these molecules are produced and secreted from the BHK cells in 
the active tetrameric conformation. 

D. Chromatography of o^M on a Superose 6 column. 

A Superose 6 column can partially resolve a 2 M molecules in the 
15 dimeric configuration from molecules in the tetrameric configuration 
(Sottrup-Jensen, L. unpublished). Human standard o^M and rajfi was analyzed 
on a 24 ml Superose 6 column (buffer: 25 mM Tris-HCl, 125 mM NaCl pH 8.0; 
flow rate: 1 ml/min; fraction size: 1 ml). Fig. 9 shows the diagrams obtained 
from the chromatography of purified human standard cr 2 M and ra^ from the K17- 
20 6 and the K16-6 BHK cell lines. Tetrameric a 2 M (Sottrup-Jensen, unpublished 
observation) will elute in fraction 12 on this type of column. It is evident 
from the chromatograms that both of the ra 2 M preparations eluted in fraction 
12, as did the human standard or 2 M. On this type of column, dimeric a^i 
molecules will elute in fraction 14 and 15 (Sottrup-Jensen, unpublished 
25 observation). This type of analysis supported the results obtained from the 
rate gels (Figs. 7 and 8), that rctjt was secreted from BHK cells in a 
tetrameric configuration. 

E. Trypsin protection analysis. 

30 When trypsin is trapped inside the or 2 M molecule, it retains its 

catalytic capacity towards low molecular weight substrates such as S-2222 
(N-benzoyl-L-Ile-L-Glu-Gly-L-Arg-p-nitroanilide), If trypsin is efficiently 
complexed with ajM, it will be protected against high molecular weight 
inhibitors such as Soybean Trypsin Inhibitor (STI) (Sottrup-Jensen, L. (1987) 

35 in: The Plasma Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic 
Press, Orlando, FL; Ganrot, P.O. (1966) Clin. Chim. Acta. 14: 493-501; 
Sottrup-Jensen, L. et al . , (1981) FEBS Lett. 128: 127-132). 

K16-6 and K17-6 derived ra 2 M was compared with human plasma ajft 
in such a protection assay. 100 /il a 2 M (in 25 mM Tris-HCl, 125 mM NaCl, pH 
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8.0) was mixed with 30 /il trypsin (0.5 mg/ml in 20 mM sodium acetate pH 5.0). 
After incubating for 2 min. 30 /il 1 mg/ml STI (in PBS) was added. 10 /il ali- 
quots were removed after 2 and 4 min. and each mixed with 750 /il 0.12 mM S- 
2222 (dissolved 0.1 M sodiumphosphate pH 8.0, 5% dimethylsulfoxide) . 
5 The change in absorbance at 405 nm was recorded for 2 min. The 

results of the assay are given in the following Table II: 

TABLE II 

10 Prep, of or 2 M. a 2 M in cuvette. Activity. 

A^min fig A^min/zxg 

Human LSJ39 0.140 5.00 0.028 

15 

K16-6 0.111 4.62 0.024 

K17-6 0.119 4.87 0.024 

20 

From these results it can be concluded that ra 2 M had essential- 
ly the same protection capacity for trypsin against STI as compared with the 
protection capacity of human plasma a 2 M. 

If a 2 M is treated with methylamine before the protection assay, 
25 the protection capacity drops dramatically. In a similar assay as that 
described above, methylamine treated human plasma only retained 17% of 
its protection capacity, while K16-6 and K17-6 ra 2 M retained 16% and 14% 
respectively. It can be concluded that ra 2 M protected trypsin against STI 
with almost the same efficiency as did human plasma a^M. 

30 

E. Amino terminal amino acid sequencing of ra^M. 

Theoretically, the or 2 M characterized in the present investiga- 
tion could only be either bovine (contaminant from serum), from hamster 
(endogenous product from the BHK cell) or derived from expression of the 

35 transfected plasmid pll67. The ELISA assay used never recognized any a 2 M in 
BHK cell conditioned medium, whether with or without added fetal calf serum. 
To make sure that the investigated a 2 M was human a 2 M, and to characterize the 
amino terminal processing of the recombinant product, amino terminal amino 
acid sequence determination was carried on out K16-6 and K17-6 ra^ as 

40 described (Sottrup-Jensen, L. et al . , (1984) J. Biol. Chem. 259: 8293-8303). 
The Edman degradation was repeated for 12 cycles, and the identity of the 
detected amino acid derivative in each cycle, was in total agreement with the 
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amino terminal sequence of human ajtt: Ser-Val -Ser-Gly-Lys-Pro-Gln-Tyr-Met- 
Val-Leu-Val-, whereas bovine has the following amino terminal sequence: 
Ala-Val-Asp-Gly-Lys-Pro-Gln-Tyr-Met-Val-Leu-Val- (unpublished, Dr. Torsten 
Kristensen, Department of Molecular Biology, University of Aarhus, Denmark.) 

5 

EXAMPLE 4. 

Construction and expression of a bait region mutant of human oiM. 

In the present example it is demonstrated that the bait region 

of human c^M can be substituted by the bait region of human pregnancy zone 
lOprotein (PZP) (Sottrup Jensen, L., Folkersen, J., Kristensen, T. and Tack, 

B.F. Partial primary structure of human pregnancy zone protein: extensive 

sequence homology with human alpha 2-macroglobul in. Proc. Natl. Acad. Sci . 

U.S.A. 81. 7353-7357, 1984; Sand, 0., Folkersen, J., Westergaard, J.G. and 

Sottrup Jensen, L. Characterization of human pregnancy zone protein. 
15Comparison with human alpha 2-macroglobulin. J.Biol .Chem. 260, 15723-15735, 

1985). The resulting c^M bait region mutant exhibited a proteinase inhibitor 

profile similar to that of human pregnancy zone protein. 

To facilitate substitution of DNA fragments encoding the bait 

region of human ajfi cDNA, target sites for the restriction enzymes PstI and 
20SacII were introduced at the 5' and at the 3' end of the cDNA region encoding 

the bait region. 

The human a 2 M expression plasmid pi 167 was digested with BamHI and 
Clal, and a 2660 bp fragment, which carried the central part of the human 
a 2 M cDNA, was subcloned in the BamHI and Clal digested vector pSX191. 
25 This vector, which had previously been constructed, is a 

derivative of pUC19. It was constructed as described: pUC19 was digested 
with EcoRI and HindlH, and a synthetic linker with the following sequence 

Kpn l Pst I EcoRI Hind3 Cla l Sph I BamHI 
30 AATTGGTACCCTGCAGGAATTCAAGCTTATCGATGGCATGCGGATCC - N0R781 
CCATGGGACGTCCTTAAGTTCGAATAGCTACCGTACGCCTAGGTCGA - N0R782 

was cloned in the digested pUC19 vector. The linker, which was an annealing 
product from the two synthetic oligonucleotides N0R781 and N0R782, has 
35 cohesive ends that will ligate to the EcoRI and the Hindi II sites of pUC19 
in such a way that these ligation sites are not regenerated in the pSX191 
vector. Thus pSX191 carried sites for Kuril, PstI, EcoRI. Hindlll, Clal, Sfihl 
and BamHI. 

The resulting plasmid pSX191or 2 M was digested with BamHI and 
40 Hindi 1 1 , and a purified 2.6 kb BamHI/Hindlll cr 2 M fragment was cloned in 
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M13mpl8 to generate M13mpl8cr 2 M for mutagenesis by described methods. A 
synthetic oligonucleotide N0R973, with the following sequence: 

5' ( TTCATACTGCTGCAGCTGTGGACAC ) 3 ' 
was used to introduce a PstI site at position 2102 (SEQ ID NO: 1) in the cDNA 
5 sequence, and a oligonucleotide (NOR974) with the following sequence: 

5' (AGCCACCC CCGCGG AGTTTACCAC)3 / 
was used to introduce a SacII site at position 2271 (SEQ ID N0:1) in the 
cDNA sequence. These sites were chosen because they did not introduce 
alterations in the encoded amino acid sequence, and they were within a 

10 convenient distance of the bait region in human or 2 M cDNA. Both primers were 
used in the same mutagenesis experiment (Kunkel, T.A., Roberts, J.D. and 
Zakour, R.A. Rapid and Efficient Site-Specific Mutagenesis without Phenotypic 
Selection. Methods in Enzvmol . 154 , 367-382, 1987); dsDNA was isolated from 
mutated M13mpl8a 2 M plaques, and the DNA was digested with the restriction 

15 enzymes PstI and SacII. Correctly mutated recombinants, which had an insert 
of 160 bp, were further analyzed by DNA sequencing (Tabor, S. and Richardson, 
C.C. DNA sequence analysis with a modified bacteriophage T7 DNA polymerase. 
Proc. Natl. Acad, Sci . U.S.A. 84, 4767-4771, 1987). A 2,6 kb BamHI/Hindlll 
fragment from a correct or^ cDNA mutant (M13mpl8a 2 M#212.1) was subcloned in 

20a BamHI/Hindlll digested pUC13 vector, and a correct subclone pl308 was 
isolated and characterized with BamHI/Hindlll and Pstl/SacII double 
digestions and DNA electrophoresis. 

The Pstl/SacII fragment in pl308 can be excised and replaced 
with a different DNA fragment, which encodes bait region variants. The 

25 resulting new variants (bait region mutants or analogs) of a 2 M cDNA can be 
isolated as BamHI/Clal fragments and subcloned back into BamHI/Clal digested 
expression vector pi 167 . 

In the present example DNA encoding the amino acids of the bait 
region for human PZP (Sottrup-Jensen et al . 1989, supra ) was obtained from 

30 ligation, annealing and cloning of 8 synthetic oligonucleotide.,. 

The DNA sequence of the synthetic fragment and the encoded amino 
acids as inserted into the a 2 M clone are given in SEQ ID N0:3, and comprises 
positions 2107 to 2305 and the corresponding amino acids. A PstI site was 
introduced at the 5' end in the synthetic fragment, and SacII and BamHI sites 

35 were introduced at the 3' end. 

This synthetic 0.2 kb DNA fragment was cloned in a Pstl/BamHI 
digested M13mpl8 vector for DNA sequencing. DNA from a clone containing the 
correct sequence was digested with PstI and SacII, and the purified 0.2 kb 
fragment was cloned in a PstI /Sac 1 1 digested and gel purified pl308 vector. 
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A correct recombinant, p267PZP, was characterized with restriction enzyme 
digestions, and from this plasmid, bait region mutated (ajft - PZP) cDNA was 
isolated as a 2.7 kb BamHI/Clal fragment and subcloned in a BamHI/Clal 
digested expression vector pll67. The resulting plasmid, designated pl365, 
5was grown as a large scale plasmid preparation, purified by CsCl centrifuga- 
tion, and cotransfected with pDHFR-I into BHK cells. 

Through this procedure the nucleotides 2102 to 2275 in SEQ ID 
N0:1 was removed and replaced with nucleotides 2102 to 2305 in SEQ ID NO:3. 

The procedures for transfection, selection of bait region mutated 
10a 2 M (designated ra^-PZP) recombinants (with an a 2 M specific ELISA), large 
scale production and purification of mutated ajft were as described elsewhere 
(EXAMPLE 2) in this application. 

Characterization of the proteinase inhibitor specificity of a bait region 

15 mutant of human or,M. 

The purified recombinant a 2 M mutant, ra 2 M-PZP, was characterized 
with respect to its inhibitor specificity profile against various proteina- 
ses by the use of previously described methods (Sand et al.1985). For 
comparison human plasma derived a 2 M and PZP were treated with the same set 

20 of proteinases in parallel reactions. The proteinases used were chymotryp- 
sin, elastase, trypsin and Staphylococcus aureus Glu-specific proteinase. 
It has been reported (Sand et al.1985) that chymotrypsin and elastase show 
a rapid reaction with both PZP and o^, while the reaction between the two 
proteinase inhibitors and trypsin and Staphylococcus aureus Glu-specific 

25 proteinase is quite dissimilar for PZP and a-M: both proteinases react rapidly 
with q 2 M, while the reaction with PZP is slow (Sand et al.1985). The reason 
for this difference in reaction rate with the different proteinases is 
believed to be due to the fact that the bait region in PZP contains strong 
specificity determinant for chymotrypsin and elastase, but none for trypsin 

30 and Staphylococcus aureus Glu-specific proteinase. 

The results of the analysis is presented in figures 10 to 13. 
Figure 10 illustrates the gel electrophoresis (10 - 20 % reducing 
SDS-PAGE) of the reaction products from chymotrypsin treated human a 2 M, human 
PZP and ra 2 M-PZP. Molecular weight markers (from top to bottom: 180, 120, 92, 

35 60, 43, 25, 14 and 6 kD) were applied to lanes 1 and 8. All samples were 
reduced. Lanes 2, 3 and 4 show the cleavage products obtained from reaction 
of chymotrypsin with human plasma derived PZP, ror 2 M-PZP and human plasma 
derived ctjM, respectively. The ratio of proteinase to inhibitor was 1:1. Lanes 
5, 5 and 7 show cleavage products from similar reactions at a ratio of 2:1 
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between proteinase and the three tested inhibitors. In all 6 lanes cleavage 
products (85 kD) could be identified. This indicated that ra 2 M-PZP reacted 
with chymo trypsin with similar characteristics as did human plasma derived 
a 2 M and PZP. 

5 Figure 11 illustrates the gel electrophoresis (10 - 20 % reducing 

SDS-PAGE) of the reaction products from elastase treated human ajft, human 
PZP and rctgM-PZP. Molecular weight markers were the same as applied on the 
gel in Fig. 2. All samples were reduced. Lanes 2, 3 and 4 show the cleavage 
products obtained from reaction of elastase with human plasma derived PZP, 
10ror 2 M-PZP and human plasma derived cr 2 M, respectively. The ratio of proteinase 
to inhibitor was 1:1. Lanes 5, 6 and 7 show cleavage products from similar 
reactions at a ratio of 2:1 between proteinase and the three tested 
inhibitors. In all 6 lanes cleavage products (85 kD) could be identified. 
This indicated that ra^-PZP reacted with elastase with similar characteris- 
tics as did human plasma derived a 2 M and PZP. 

Figure 12 illustrates the gel electrophoresis (10 - 20 % reducing 
SDS-PAGE) of the reaction products from trypsin treated human a 2 M, human PZP 
and ra^M-PZP. Molecular weight markers were the same as applied on the gel 
in Fig. 2. All samples were reduced. Lanes 2, 3 and 4 show the cleavage 
20 products obtained from reaction of trypsin with human plasma derived PZP, 
human plasma derived a 2 M and ra 2 M-PZP, respectively. The ratio of proteinase 
to inhibitor was 1:1. Lanes 5, 6 and 7 show cleavage products from similar 
reactions at a ratio of 2:1 between proteinase and the three tested 
inhibitors. In lanes 3 and 6 cleavage products (85 kD) could be identified 
25 from the reaction between trypsin and a 2 M. In lanes 2, 4, 5 and 7 no cleavage 
products were observed from the reaction of trypsin with PZP and ra 2 M-PZP. 
This result demonstrated that ror 2 M-PZP reacted poorly with trypsin as did 
human plasma derived PZP, while or 2 M was cleaved in the reaction with trypsin. 

Figure 13 illustrates the gel electrophoresis (10 - 20 % reducing 
30 SDS-PAGE) of the reaction products from Staphylococcus aureus Glu-specific 
protease treated human a 2 M, human PZP and ra 2 M-PZP. Molecular weight markers 
were the same as applied on the gel in Fig. 2. All samples were reduced. 
Lanes 2, 3 and 4 show the cleavage products obtained from reaction of 
Staphylococcus aureus Glu-specific protease with human plasma derived PZP, 
35ror 2 M-PZP and human plasma derived a 2 M, respectively. The ratio of proteinase 
to inhibitor was 1:1. Lanes 5, 6 and 7 show cleavage products from similar 
reactions at a ratio of 2:1 between proteinase and the three tested 
inhibitors. In lanes 4 and 7 cleavage products (85 kD) could be identified 
from the reaction between Staphylococcus aureus Glu-specific protease and 
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a 2 M. In lanes 2, 3, 5 and 6 much less cleavage product could be identified 
from the reaction of this proteinase with .PZP and ro^-PZP. This result 
demonstrated that ra 2 M-PZP reacted poorly with the Staphylococcus aureus 
proteinase as did human plasma derived PZP, while a 2 M was cleaved in the 

5 reaction with this proteinase. 

It can be concluded that niaM-PZP showed the same pattern of 
reaction with four proteinases as did human plasma derived PZP. This pattern 
of reaction was different from the corresponding pattern obtained from 
reaction with o$l. Thus ra 2 M-PZP has been demonstrated to have a proteinase 

10 inhibitor profile similar to native PZP and dissimilar to aJA. Thus it has 
been demonstrated that the proteinase inhibitor profile of can be 
modulated by substitution of DNA fragments encoding the bait region. 

The substitution as described in this invention did not destroy 
the activity of the proteinase inhibitor, and it is therefore demonstrated 

15 that functional macroglobul in hybrids can be constructed by substitutions 
(mutations) in the bait region. The finding will lead to the design of a 2 M- 
derivatives with new desired proteinase specificities. No doubt, these 
results could be extended to other macroglobul in based hybrids, in which the 
bait region can be modified at will to obtain new inhibitor specificities. 

20 Aggressive activity of proteinases is often a problem in relation 

to various diseases (e.g. the activity of elastase and cathepsin G in severe 
inflammation leads to tissue and organ destruction and failure). Inhibitors 
of such proteinases will be useful in drug design. In situations where the 
target site for the proteinase is known, but no inhibitor can be identified, 

25a 2 M can be engineered (mutated in the bait region) to obtain the desired 
specificity. In a situation where the target specificity of the proteinase 
in question is unknown, saturation mutagenesis or random synthesis of the 
bait region will lead to an indefinite number of target sequences that can 
be introduced and expressed in hybrid macroglobul ins. These hybrids can be 

30 screened for proteinase inhibition, and the target sequence(s) can be 
identified. The resulting a 2 M analog can be produced and purified as described 
elsewhere in this invention. Upon injection into the circulation such or 2 M 
analogs will inhibit and clear from the blood any proteinase of the given 
specificity. 

35 Introduction of protein analogs or mutants in the human body 

always raises the possibility for antigenicity. The generation of- a panel 
of 45 mouse monoclonal antibodies against human q 2 M has been described (Van 
Leuven et al.1988; Delain et al.1988). None of these antibodies were directed 
against the bait region. This indicates that the bait region is not highly 
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antigenic and that mutants in this region of the molecule can be generated 
and used for therapeutical uses without risk for antibody development. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Novo Nordisk A/S 

(ii) TITLE OF INVENTION: Expression of Plasma Glycoproteins 
(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Novo Nordisk A/S, Patent Department 

(B) STREET: Novo Alle 

(C) CITY: Bagsvaerd 

(E) COUNTRY: DENMARK 

(F) ZIP: DK-2880 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DK 4235/89, DK 4236/89, DK 4237/89 

(B) FILING DATE: 29-AUG-1989 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(F) TISSUE TYPE: Hepatic 

(G) CELL TYPE: Hepatoblastoma 

(H) CELL LINE: HepG2 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 29.. 4450 
(D) OTHER INFORMATION: 



(xi ) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GTCTCCTCCA GCTCCTTCTT TCTGCAAC ATG GGG AAG AAC AAA CTC CTT CAT 52 

Met Gly Lys Asn Lys Leu Leu His 
1 5 

CCA AGT CTG GTT CTT CTC CTC TTG GTC CTC CTG CCC ACA GAC GCC TCA 100 
Pro Ser Leu Val Leu Leu Leu Leu Val Leu Leu Pro Thr Asp Ala Ser 
10 15 20 
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GTC TCT GGA AAA CCG CAG TAT ATG GTT CTG GTC CCC TCC CTG CTC CAC 148 
Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro Ser Leu Leu His 
25 30 35 40 

ACT GAG ACC ACT GAG AAG GGC TGT GTC CTT CTG AGC TAC CTG AAT GAG 196 
Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser Tyr Leu Asn Glu 

45 50 55 

ACA GTG ACT GTA AGT GCT TCC TTG GAG TCT GTC AGG GGA AAC AGG AGC 244 
Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg Gly Asn Arg Ser 

60 65 70 

CTC TTC ACT GAC CTG GAG GCG GAG AAT GAC GTA CTC CAC TGT GTC GCC 292 
Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu His Cys Val Ala 
75 80 85 

TTC GCT GTC CCA AAG TCT TCA TCC AAT GAG GAG GTA ATG TTC CTC ACT 340 
Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val Met Phe Leu Thr 
90 95 100 

GTC CAA GTG AAA GGA CCA ACC CAA GAA TTT AAG AAG CGG ACC ACA GTG 388 
Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys Arg Thr Thr Val 
105 110 115 120 

ATG GTT AAG AAC GAG GAC AGT CTG GTC TTT GTC CAG ACA GAC AAA TCA 436 
Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin Thr Asp Lys Ser 

125 130 135 

ATC TAC AAA CCA GGG CAG ACA GTG AAA TTT CGT GTT GTC TCC ATG GAT 484 
He Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val Val Ser Met Asp 

140 145 150 

GAA AAC TTT CAC CCC CTG AAT GAG TTG ATT CCA CTA GTA TAC ATT CAG 532 
Glu Asn Phe His Pro Leu Asn Glu Leu He Pro Leu Val Tyr He Gin 
155 160 165 

GAT CCC AAA GGA AAT CGC ATC GCA CAA TGG CAG AGT TTC CAG TTA GAG 580 
Asp Pro Lys Gly Asn Arg He Ala Gin Trp Gin Ser Phe Gin Leu Glu 
170 175 180 

GGT GGC CTC AAG CAA TTT TCT TTT CCC CTC TCA TCA GAG CCC TTC CAG 628 
Gly Gly Leu Lys Gin Phe Ser Phe Pro Leu Ser Ser Glu Pro Phe Gin 
185 190 195 200 

GGC TCC TAC AAG GTG GTG GTA CAG AAG AAA TCA GGT GGA AGG ACA GAG 676 
Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser Gly Gly Arg Thr Glu 

205 210 215 

CAC CCT TTC ACC GTG GAG GAA TTT GTT CTT CCC AAG TTT GAA GTA CAA 724 
His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lys Phe Glu Val Gin 

220 225 230 

GTA ACA GTG CCA AAG ATA ATC ACC ATC TTG GAA GAA GAG ATG AAT GTA 772 
Val Thr Val Pro Lys He He Thr He Leu Glu Glu Glu Met Asn Val 
235 240 245 
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TCA GTG TGT GGC CTA TAC ACA TAT GGG AAG CCT GTC CCT GGA CAT GTG 
Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val Pro Gly His Val 
250 255 260 

ACT GTG AGC ATT TGC AGA AAG TAT AGT GAC GCT TCC GAC TGC CAC GGT 
Thr Val Ser He Cys Arg Lys Tyr Ser Asp Ala Ser Asp Cys His Gly 
265 270 275 280 

GAA GAT TCA CAG GCT TTC TGT GAG AAA TTC AGT GGA CAG CTA AAC AGC 
Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly Gin Leu Asn Ser 

285 290 295 

CAT GGC TGC TTC TAT CAG CAA GTA AAA ACC AAG GTC TTC CAG CTG AAG 
His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val Phe Gin Leu Lys 

300 305 310 

AGG AAG GAG TAT GAA ATG AAA CTT CAC ACT GAG GCC CAG ATC CAA GAA 
Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala Gin He Gin Glu 
315 320 325 

GAA GGA ACA GTG GTG GAA TTG ACT GGA AGG CAG TCC AGT GAA ATC ACA 
Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser Ser Glu He Thr 
330 335 340 

AGA ACC ATA ACC AAA CTC TCA TTT GTG AAA GTG GAC TCA CAC TTT CGA 
Arg Thr lie Thr Lys Leu Ser Phe Val Lys Val Asp Ser His Phe Arg 
345 350 355 350 

CAG GGA ATT CCC TTC TTT GGG CAG GTG CGC CTA GTA GAT GGG AAA GGC 
Gin Gly He Pro Phe Phe Gly Gin Val Arg Leu Val Asp Gly Lys Gly 

365 370 375 

GTC CCT ATA CCA AAT AAA GTC ATA TTC ATC AGA GGA AAT GAA GCA AAC 
Val Pro He Pro Asn Lys Val He Phe He Arg Gly Asn Glu Ala Asn 

380 385 390 

TAT TAC TCC AAT GCT ACC ACG GAT GAG CAT GGC CTT GTA CAG TTC TCT 
Tyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu Val Gin Phe Ser 
395 400 405 

ATC AAC ACC ACC AAT GTT ATG GGT ACC TCT CTT ACT GTT AGG GTC AAT 
He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr Val Arg Val Asn 
410 415 420 

TAC AAG GAT CGT AGT CCC TGT TAC GGC TAC CAG TGG GTG TCA GAA GAA 
Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp Val Ser Glu Glu 
425 430 435 440 

CAC GAA GAG GCA CAT CAC ACT GCT TAT CTT GTG TTC TCC CCA AGC AAG 
His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe Ser Pro Ser Lys 

445 450 455 

AGC TTT GTC CAC CTT GAG CCC ATG TCT CAT GAA CTA CCC TGT GGC CAT 
Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu Pro Cys Gly His 

460 465 470 
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ACT CAG ACA GTC CAG GCA CAT TAT ATT CTG AAT GGA GGC ACC CTG CTG 1492 
Thr Gin Thr Val Gin Ala His Tyr He Leu Asn Gly Gly Thr Leu Leu 
475 480 485 

GGG CTG AAG AAG CTC TCC TTC TAT TAT CTG ATA ATG GCA AAG GGA GGC 1540 
Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu He Met Ala Lys Gly Gly 
490 495 500 

ATT GTC CGA ACT GGG ACT CAT GGA CTG CTT GTG AAG CAG GAA GAt ATG 1588 
He Val Arg Thr Gly Thr His Gly Leu Leu Val Lys Gin Glu Asp Met 
505 510 515 520 

AAG GGC CAT TTT TCC ATC TCA ATC CCT GTG AAG TCA GAC ATT GCT CCT 1636 
Lys Gly His Phe Ser lie Ser He Pro Val Lys Ser Asp He Ala Pro 

525 530 535 

GTC GCT CGG TTG CTC ATC TAT GCT GTT TTA CCT ACC GGG GAC GTG ATT 1684 
Val Ala Arg Leu Leu He Tyr Ala Val Leu Pro Thr Gly Asp Val He 

540 545 550 

GGG GAT TCT GCA AAA TAT GAT GTT GAA AAT TGT CTG GCC AAC AAG GTG 1732 
Gly Asp Ser Ala Lys Tyr Asp Val Glu Asn Cys Leu Ala Asn Lys Val 
555 560 565 

GAT TTG AGC TTC AGC CCA TCA CAA AGT CTC CCA GCC TCA CAC GCC CAC 1780 
Asp Leu Ser Phe Ser Pro Ser Gin Ser Leu Pro Ala Ser His Ala His 
570 575 580 

CTG CGA GTC ACA GCG GCT CCT CAG TCC GTC TGC GCC CTC CGT GCT GTG 1828 
Leu Arg Val Thr Ala Ala Pro Gin Ser Val Cys Ala Leu Arg Ala Val 
585 590 595 600 

GAC CAA AGC GTG CTG CTC ATG AAG CCT GAT GCT GAG CTC TCG GCG TCC 1876 
Asp Gin Ser Val Leu Leu Met Lys Pro Asp Ala Glu Leu Ser Ala Ser 

605 610 615 

TCG GTT TAC A": CTG CTA CCA GAA AAG GAC CTC ACT GGC TTC CCT GGG 1924 
Ser Val Tyr A^,i Leu Leu Pro Glu Lys Asp Leu Thr Gly Phe Pro Gly 

620 625 630 

CCT TTG AAT GAC CAG GAC GAT GAA GAC TGC ATC AAT CGT CAT AAT GTC 1972 
Pro Leu Asn Asp Gin Asp Asp Glu Asp Cys He Asn Arg His Asn Val 
635 640 645 

TAT ATT AAT GGA ATC ACA TAT ACT CCA GTA TCA AGT ACA AAT GAA AAG 2020 
Tyr lie Asn Gly lie Thr Tyr Thr Pro Val Ser Ser Thr Asn Glu Lys 
650 655 660 

GAT ATG TAC AGC TTC CTA GAG GAC ATG GGC TTA AAG GCA TTC ACC AAC 2068 
Asp Met Tyr Ser Phe Leu Glu Asp Met Gly Leu Lys Ala Phe Thr Asn 
665 670 675 680 

TCA AAG ATT CGT AAA CCC AAA ATG TGT CCA CAG CTT CAA CAG TAT GAA 2116 
Ser Lys He Arg Lys Pro Lys Met Cys Pro Gin Leu Gin Gin Tyr Glu 

685 690 695 
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ATG CAT GGA CCT GAA GGT CTA CGT GTA GGT TTT TAT GAG TCA GAT GTA 
Met His Gly Pro Glu Gly Leu Arg Val Gly Phe Tyr Glu Ser Asp Val 

700 705 710 

ATG GGA AGA GGC CAT GCA CGC CTG GTG CAT GTT GAA GAG CCT CAC ACG 
Met Gly Arg Gly His Ala Arg Leu Val His Val Glu Glu Pro His Thr 
715 720 725 

GAG ACC GTA CGA AAG TAC TTC CCT GAG ACA TGG ATC TGG GAT TTG GTG 
Glu Thr Val Arg Lys Tyr Phe Pro Glu Thr Trp He Trp Asp Leu Val 
730 735 740 

GTG GTA AAC TCA GCA GGT GTG GCT GAG GTA GGA GTA ACA GTC CCT GAC 
Val Val Asn Ser Ala Gly Val Ala Glu Val Gly Val Thr Val Pro Asp 
745 750 755 760 

ACC ATC ACC GAG TGG AAG GCA GGG GCC TTC TGC CTG TCT GAA GAT GCT 
Thr He Thr Glu Trp Lys Ala Gly Ala Phe Cys Leu Ser Glu Asp Ala 

765 770 775 

GGA CTT GGT ATC TCT TCC ACT GCC TCT CTC CGA GCC TTC CAG CCC TTC 
Gly Leu Gly He Ser Ser Thr Ala Ser Leu Arg Ala Phe Gin Pro Phe 

780 785 790 

TTT GTG GAG CTT ACA ATG CCT TAC TCT GTG ATT CGT GGA GAG GCC TTC 
Phe Val Glu Leu Thr Met Pro Tyr Ser Val lie Arg Gly Glu Ala Phe 
795 800 805 

ACA CTC AAG GCC ACG GTC CTA AAC TAC CTT CCC AAA TGC ATC CGG GTC 
Thr Leu Lys Ala Thr Val Leu Asn Tyr Leu Pro Lys Cys lie Arg Val 
810 815 820 

AGT GTG CAG CTG GAA GCC TCT CCC GCC TTC CTA GCT GTC CCA GTG GAG 
Ser Val Gin Leu Glu Ala Ser Pro Ala Phe Leu Ala Val Pro Val Glu 
825 830 835 840 

AAG GAA CAA GCG CCT CAC TGC ATC TGT GCA AAC GGG CGG CAA ACT GTG 
Lys Glu Gin Ala Pro His Cys He Cys Ala Asn Gly Arg Gin Thr Val 

845 850 855 

TCC TGG GCA GTA ACC CCA AAG TCA TTA GGA AAT GTG AAT TTC ACT GTG 
Ser Trp Ala Val Thr Pro Lys Ser Leu Gly Asn Val Asn Phe Thr Val 

860 865 870 

AGC GCA GAG GCA CTA GAG TCT CAA GAG CTG TGT GGG ACT GAG GTG CCT 
Ser Ala Glu Ala Leu Glu Ser Gin Glu Leu Cys Gly Thr Glu Val Pro 
875 880 885 

TCA GTT CCT GAA CAC GGA AGG AAA GAC ACA GTC ATC AAG CCT CTG TTG 
Ser Val Pro Glu His Gly Arg Lys Asp Thr Val He Lys Pro Leu Leu 
890 895 900 

GTT GAA CCT GAA GGA CTA GAG AAG GAA ACA ACA TTC AAC TCC CTA CTT 
Val Glu Pro Glu Gly Leu Glu Lys Glu Thr Thr Phe Asn Ser Leu Leu 
905 910 915 920 



2164 



2212 



2260 



2308 



2356 



2404 



2452 



2500 



2548 



2596 



2644 



2692 



2740 



2788 



WO 91/03557 



PCT/DK90/00225 



39 

TGT CCA TCA GGT GGT GAG GTT TCT GAA GAA TTA TCC CTG AAA CTG CCA 2836 

Cys Pro Ser Gly Gly Glu Val Ser Glu Glu Leu Ser Leu Lys Leu Pro 

925 930 935 

CCA AAT GTG GTA GAA GAA TCT GCC CGA GCT TCT GTC TCA GTT TTG GGA 2884 

Pro Asn Val Val Glu Glu Ser Ala Arg Ala Ser Val Ser Val Leu Gly 

940 945 950 

GAC ATA TTA GGC TCT GCC ATG CAA AAC ACA CAA AAT CTT CTC CAG ATG 2932 

Asp He Leu Gly Ser Ala Met Gin Asn Thr Gin Asn Leu Leu Gin Met 

955 960 965 

CCC TAT GGC TGT GGA GAG CAG AAT ATG GTC CTC TTT GCT CCT AAC ATC 2980 

Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe Ala Pro Asn He 

970 975 980 

TAT GTA CTG GAT TAT CTA AAT GAA ACA CAG CAG CTT ACT CCA GAG ATC 3028 

Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Gin Leu Thr Pro Glu He 
985 990 995 1000 

AAG TCC AAG GCC ATT GGC TAT CTC AAC ACT GGT TAC CAG AGA CAG TTG 3076 

Lys Ser Lys Ala He Gly Tyr Leu Asn Thr Gly Tyr Gin Arg Gin Leu 

1005 1010 1015 

AAC TAC AAA CAC TAT GAT GGC TCC TAC AGC ACC TTT GGG GAG CGA TAT 3124 

Asn Tyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe Gly Glu Arg Tyr 

1020 1025 1030 

GGC AGG AAC CAG GGC AAC ACC TGG CTC ACA GCC TTT GTT CTG AAG ACT 3172 

Gly Arg Asn Gin Gly Asn Thr Trp Leu Thr Ala Phe Val Leu Lys Thr 

1035 1040 1045 

TTT GCC CAA GCT CGA GCC TAC ATC TTC ATC GAT GAA GCA CAC ATT ACC 3220 

Phe Ala Gin Ala Arg Ala Tyr He Phe He Asp Glu Ala His He Thr 

1050 1055 1060 

CAA GCC CTC ATA TGG CTC TCC CAG AGG CAG AAG GAC AAT GGC TGT TTC 3268 

Gin Ala Leu He Trp Leu Ser Gin Arg Gin Lys Asp Asn Gly Cys Phe 
1065 1070 1075 1080 

AGG AGC TCT GGG TCA CTG CTC AAC AAT GCC ATA AAG GGA GGA GTA GAA 3316 

Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys Gly Gly Val Glu 

1085 1090 1095 

GAT GAA GTG ACC CTC TCC GCC TAT ATC ACC ATC GCC CTT CTG GAG ATT 3364 

Asp Glu Val Thr Leu Ser Ala Tyr He Thr lie Ala Leu Leu Glu He 

1100 1105 1110 

CCT CTC ACA GTC ACT CAC CCT GTT GTC CGC AAT GCC CTG TTT TGC CTG 3412 

Pro Leu Thr Val Thr His Pro Val Val Arg Asn Ala Leu Phe Cys Leu 

1115 1120 1125 

GAG TCA GCC TGG AAG ACA GCA CAA GAA GGG GAC CAT GGC AGC CAT GTA 3460 

Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His Gly Ser His Val 

1130 1135 1140 
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TAT ACC AAA GCA CTG CTG GCC TAT GCT TTT GCC CTG GCA GGT AAC CAG 3508 

Tyr Thr Lys Ala Leu Leu Ala Tyr Ala Phe Ala Leu Ala Gly Asn Gin 
1145 1150 1155- 1160 

GAC AAG AGG AAG GAA GTA CTC AAG TCA CTT AAT GAG GAA GCT GTG AAG 3556 
Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu Glu Ala Val Lys 

1165 1170 1175 

AAA GAC AAC TCT GTC CAT TGG GAG CGC CCT CAG AAA CCC AAG GCA CCA 3604 
Lys Asp Asn Ser Val His Trp Glu Arg Pro Gin Lys Pro Lys Ala Pro 

1180 1185 1190 

GTG GGG CAT TTT TAC GAA CCC CAG GCT CCC TCT GCT GAG GTG GAG ATG 3652 
Val Gly His Phe Tyr Glu Pro Gin Ala Pro Ser Ala Glu Val Glu Met 
1195 1200 1205 

ACA TCC TAT GTG CTC CTC GCT TAT CTC ACG GCC CAG CCA GCC CCA ACC 3700 
Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin Pro Ala Pro Thr 
1210 1215 1220 

KG GAG GAC CTG ACC TCT GCA ACC AAC ATC GTG AAG TGG ATC ACG AAG 3748 
Ser Glu Asp Leu Thr Ser Ala Thr Asn He Val Lys Trp He Thr Lys 
1225 1230 1235 1240 

CAG CAG AAT GCC CAG GGC GGT TTC TCC TCC ACC CAG CAC ACA GTG GTG 3796 
Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin His Thr Val Val 

1245 1250 1255 

GCT CTC CAT GCT CTG TCC AAA TAT GGA GCA GCC ACA TTT ACC AGG ACT 3844 
Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr Phe Thr Arg Thr 

1260 1265 1270 

GGG AAG GCT GCA CAG GTG ACT ATC CAG TCT TCA GGG ACA TTT TCC AGC 3892 
Gly Lys Ala Ala Gin Val Thr He Gin Ser Ser Gly Thr Phe Ser Ser 
1275 1280 1285 

AAA TTC CAA GTG GAC AAC AAC AAC CGC CTG TTA CTG CAG CAG GTC TCA 3940 
Lys Phe Gin Val Asp Asn Asn Asn Arg Leu Leu Leu Gin Gin Val Ser 
1290 1295 1300 

TTG CCA GAG CTG CCT GGG GAA TAC AGC ATG AAA GTG ACA GGA GAA GGA 3988 
Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val Thr Gly Glu Gly 
1305 1310 1315 1320 

TGT GTC TAC CTC CAG ACA TCC TTG AAA TAC AAT ATT CTC CCA GAA AAG 4036 
Cys Val Tyr Leu Gin Thr Ser Leu Lys Tyr Asn He Leu Pro Glu Lys 

1325 1330 1335 

GAA GAG TTC CCC TTT GCT TTA GGA GTG CAG ACT CTG CCT CAA ACT TGT 4084 
Glu Glu Phe Pro Phe Ala Leu Gly Val Gin Thr Leu Pro Gin Thr Cys 

1340 1345 1350 

GAT GAA CCC AAA GCC CAC ACC AGC TTC CAA ATC TCC CTA AGT GTC AGT 4132 
Asp Glu Pro Lys Ala His Thr Ser Phe Gin He Ser Leu Ser Val Ser 
1355 1360 1365 
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TAC ACA GGG AGC CGC TCT GCC TCC AAC ATG GCG ATC GTT GAT GTG AAG 4180 
Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala He Val Asp Val Lys 
1370 1375 1380 

ATG GTC TCT GGC TTC ATT CCC CTG AAG CCA ACA GTG AAA ATG CTT GAA 4228 
Met ,'al Ser Gly Phe He Pro Leu Lys Pro Thr Val Lys Met Leu Glu 
1385 1390 1395 1400 

AGA TCT AAC CAT GTG AGC CGG ACA GAA GTC AGC AGC AAC CAT GTC TTG 4276 
Arg Ser Asn His Val Ser Arg Thr Glu Val Ser Ser Asn His Val Leu 

1405 1410 1415 

ATT TAC CTT GAT AAG GTG TCA AAT CAG ACA CTG AGC TTG TTC TTC ACG 4324 
He Tyr Leu Asp Lys Val Ser Asn Gin Thr Leu Ser Leu Phe Phe Thr 

1420 1425 1430 

GTT CTG CAA GAT GTC CCA GTA AGA GAT CTC AAA CCA GCC ATA GTG AAA 4372 
Val Leu Gin Asp Val Pro Val Arg Asp Leu Lys Pro Ala He Val Lys 
1435 1440 1445 

GTC TAT GAT TAC TAC GAG ACG GAT GAG TTT GCA ATT GCT GAG TAC AAT 4420 
Val Tyr Asp Tyr Tyr Glu Thr Asp Glu Phe Ala He Ala Glu Tyr Asn 
1450 1455 1450 

GCT CCT TGC AGC AAA GAT CTT GGA AAT GCT TGAAGACCAC AAGGCTGAAA 4470 
Ala Pro Cys Ser Lys Asp Leu Gly Asn Ala 
1465 1470 

AGTGCTTTGC TGGAGTCCTG TTCTCTGAGC TCCACAGAAG ACACGTGTTT TTGTATCTTT 4530 
AAAGACTTGA TGAATAAACA CTTTTTCTGG TCAAAAAAA 4569 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1474 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(E) FEATURES: bait region: 690-730 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Gly Lys Asn Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu 
15 10 15 

Val Leu Leu Pro Thr Asp Ala Ser Val Ser Gly Lys Pro Gin Tyr Met 

20 25 30 

Val Leu Val Pro Ser Leu Leu His Thr Glu Thr Thr Glu Lys Gly Cys 
35 40 45 

Val Leu Leu Ser Tyr Leu Asn Glu Thr Val Thr Val Ser Ala Ser Leu 
50 55 60 

Glu Ser Val Arg Gly Asn Arg Ser Leu Phe Thr Asp Leu Glu Ala Glu 
65 70 75 80 
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Asn Asp Val Leu His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser 

85 90 95 

Asn Glu Glu Val Met Phe Leu Thr Val Gin Val Lys Gly Pro Thr Gin 

100 105 110 

Glu Phe Lys Lys Arg Thr Thr Val Met Val Lys Asn Glu Asp Ser Leu 
115 120 125 

Val Phe Val Gin Thr Asp Lys Ser He Tyr Lys Pro Gly Gin Thr Val 
130 135 140 

Lys Phe Arg Val Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu 
145 150 155 160 

Leu He Pro Leu Val Tyr He Gin Asp Pro Lys Gly Asn Arg He Ala 

165 170 175 

Gin Trp Gin Ser Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe 

180 185 190 

Pro Leu Ser Ser Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin 
195 200 205 

Lys Lys Ser Gly Gly Arg Thr Glu His Pro Phe Thr Val Glu Glu Phe 
210 215 220 

Val Leu Pro Lys Phe Glu Val Gin Val Thr Val Pro Lys lie He Thr 
225 230 235 240 

lie Leu Glu Glu Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr 

245 250 255 

Gly Lys Pro Val Pro Gly His Val Thr Val Ser lie Cys Arg Lys Tyr 

260 265 270 

Ser Asp Ala Ser Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Glu 
275 280 285 

Lys Phe Ser Gly Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val 
290 295 300 

Lys Thr Lys Val Phe Gin Leu Lys Arg Lys Glu Tyr Glu Met Lys Leu 
305 310 315 320 

His Thr Glu Ala Gin He Gin Glu Glu Gly Thr Val Val Glu Leu Thr 

325 330 335 

Gly Arg Gin Ser Ser Glu He Thr Arg Thr He Thr Lys Leu Ser Phe 

340 345 350 

Val Lys Val Asp Ser His Phe Arg Gin Gly He Pro Phe Phe Gly Gin 
355 360 365 

Val Arg Leu Val Asp Gly Lys Gly Val Pro He Pro Asn Lys Val He 
370 375 380 
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Phe He Arg Gly Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp 
385 390 395 400 

* 

Glu His Gly Leu Val Gin Phe Ser lie Asn Thr Thr Asn Val Met Gly 

405 410 415 

Thr Ser Leu Thr Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr 

420 425 430 

Gly Tyr Gin Trp Val Ser Glu Glu His Glu Glu Ala His His Thr Ala 
435 440 445 

Tyr Leu Val Phe Ser Pro Ser Lys Ser Phe Val His Leu Glu Pro Met 
450 455 460 

Ser His Glu Leu Pro Cys Gly His Thr Gin Thr Val Gin Ala His Tyr 
465 470 475 480 

lie Leu Asn Gly Gly Thr Leu Leu Gly Leu Lys Lys Leu Ser Phe Tyr 

485 490 495 

Tyr Leu He Met Ala Lys Gly Gly He Val Arg Thr Gly Thr His Gly 

500 505 510 

Leu Leu Val Lys Gin Glu Asp Met Lys Gly His Phe Ser lie Ser He 
515 520 525 

Pro Val Lys Ser Asp He Ala Pro Val Ala Arg Leu Leu He Tyr Ala 
530 535 540 

Val Leu Pro Thr Gly Asp Val He Gly Asp Ser Ala Lys Tyr Asp Val 
545 550 555 560 

Glu Asn Cys Leu Ala Asn Lys Val Asp Leu Ser Phe Ser Pro Ser Gin 

565 570 575 

Ser Leu Pro Ala Ser His Ala His Leu Arg Val Thr Ala Ala Pro Gin 

580 585 590 

Ser Val Cys Ala Leu Arg Ala Val Asp Gin Ser Val Leu Leu Met Lys 
595 600 605 

Pro Asp Ala Glu Leu Ser Ala Ser Ser Val Tyr Asn Leu Leu Pro Glu 
610 615 620 

Lys Asp Leu Thr Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu 
625 630 635 640 

Asp Cys He Asn Arg His Asn Val Tyr He Asn Gly He Thr Tyr Thr 

645 650 655 

Pro Val Ser Ser Thr Asn Glu Lys Asp Met Tyr Ser Phe Leu Glu Asp 

660 665 670 

Met Gly Leu Lys Ala Phe Thr Asn Ser Lys He Arg Lys Pro Lys Met 
675 680 685 
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Cys Pro Gin Leu Gin Gin Tyr Glu Met His Gly Pro Glu Gly Leu Arg 
690 695 700 

Val Gly Phe Tyr Glu Ser Asp Val Met Gly Arg Gly His Ala Arg Leu 
705 710 715 720 

Val His Val Glu Glu Pro His Thr Glu Thr Val Arg Lys Tyr Phe Pro 

725 730 735 

Glu Thr Trp He Trp Asp Leu Val Val Val Asn Ser Ala Gly Val Ala 

740 745 750 

Glu Val Gly Val Thr Val Pro Asp Thr He Thr Glu Trp Lys Ala Gly 
755 760 765 

Ala Phe Cys Leu Ser Glu Asp Ala Gly Leu Gly He Ser Ser Thr Ala 
770 775 780 

Ser Leu Arg Ala Phe Gin Pro Phe Phe Val Glu Leu Thr Met Pro Tyr 
785 790 795 800 

Ser Val He Arg Gly Glu Ala Phe Thr Leu Lys Ala Thr Val Leu Asn 

805 810 815 

Tyr Leu Pro Lys Cys He Arg Val Ser Val Gin Leu Glu Ala Ser Pro 

820 825 830 

Ala Phe Leu Ala Val Pro Val Glu Lys Glu Gin Ala Pro His Cys He 
835 840 845 

Cys Ala Asn Gly Arg Gin Thr Val Ser Trp Ala Val Thr Pro Lys Ser 
850 855 860 

Leu Gly Asn Val Asn Phe Thr Val Ser Ala Glu Ala Leu Glu Ser Gin 
865 870 875 880 

Glu Leu Cys Gly Thr Glu Val Pro Ser Val Pro Glu His Gly Arg Lys 

885 890 895 

Asp Thr Val He Lys Pro Leu Leu Val Glu Pro Glu Gly Leu Glu Lys 

900 905 910 

Glu Thr Thr Phe Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu Val Ser 
915 920 925 

Glu Glu Leu Ser Leu Lys Leu Pro Pro Asn Val Val Glu Glu Ser Ala 
930 935 940 

Arg Ala Ser Val Ser Val Leu Gly Asp lie Leu Gly Ser Ala Met Gin 
945 950 955 960 

Asn Thr Gin Asn Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn 

965 970 975 

Met Val Leu Phe Ala Pro Asn He Tyr Val Leu Asp Tyr Leu Asn Glu 

980 985 990 
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Thr Gin Gin Leu Thr Pro Glu He Lys Ser Lys Ala He Gly Tyr Leu 
995 1000 1005 

Asn Thr Gly Tyr Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp Gly Ser 
1010 1015 1020 

Tyr Ser Thr Phe Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp 
1025 1030 1035 1040 

Leu Thr Ala Phe Val Leu Lys Thr Phe Ala Gin Ala Arg Ala Tyr He 

1045 1050 1055 

Phe He Asp Glu Ala His He Thr Gin Ala Leu He Trp Leu Ser Gin 

1060 1065 1070 

Arg Gin Lys Asp Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn 
1075 1080 1085 

Asn Ala He Lys Gly Gly Val Glu Asp Glu Val Thr Leu Ser Ala Tyr 
1090 1095 1100 

He Thr He Ala Leu Leu Glu He Pro Leu Thr Val Thr His Pro Val 
1105 1110 1115 1120 

Val Arg Asn Ala Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin 

1125 1130 1135 

Glu Gly Asp His Gly Ser His Val Tyr Thr Lys Ala Leu Leu Ala Tyr 

1140 1145 1150 

Ala Phe Ala Leu Ala Gly Asn Gin Asp Lys Arg Lys Glu Val Leu Lys 
1155 1160 1165 

Ser Leu Asn Glu Glu Ala Val Lys Lys Asp Asn Ser Val His Trp Glu 
1170 1175 1180 

Arg Pro Gin Lys Pro Lys Ala Pro Val Gly His Phe Tyr Glu Pro Gin 
1185 1190 1195 1200 

Ala Pro Ser Ala Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr 

1205 1210 1215 

Leu Thr Ala Gin Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr 

1220 1225 1230 

Asn He Val Lys Trp He Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe 
1235 1240 1245 

Ser Ser Thr Gin His Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr 
1250 1255 1260 

Gly Ala Ala Thr Phe Thr Arg Thr Gly Lys Ala Ala Gin Val Thr He 
1265 1270 1275 1280 

Gin Ser Ser Gly Thr Phe Ser Ser Lys Phe Gin Val Asp Asn Asn Asn 

1285 1290 1295 
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Arq Leu Leu Leu Gin Gin Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr 

1300 1305 1310 

Ser Met Lys Val Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu 
1315 1320 1325 

Lys Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly 
1330 1335 1340 

Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser 
1345 1350 1355 1360 

Phe Gin He Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser 

1365 1370 1375 

Asn Met Ala He Val Asp Val Lys Met Val Ser Gly Phe He Pro Leu 

1380 1385 1390 

Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr 
1395 1400 1405 

Glu Val Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val Ser Asn 
1410 1415 1420 

Gin Thr Leu Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg 
1425 1430 1435 1440 

Asp Leu Lys Pro Ala He Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp 

1445 1450 1455 

Glu Phe Ala He Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly 

1460 1465 1470 

Asn Ala 



(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: Y 

(iv) ANTI-SENSE: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 29.. 4480 
(D) OTHER INFORMATION: 
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(ix) FEATURE: 

(A) NAME/KEY: insertionseq 

(B) LOCATION: 2102.. 2305 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

GTCTCCTCCA GCTCCTTCTT TCTGCAAC ATG GGG AAG AAC AAA CTC CTT CAT 

Met Gly Lys Asn Lys Leu Leu His 
1 5 

CCA AGT CTG GTT CTT CTC CTC TTG GTC CTC CTG CCC ACA GAC GCC TCA 1 
Pro Ser Leu Val Leu Leu Leu Leu Val Leu Leu Pro Thr Asp Ala Ser 
10 15 20 

GTC TCT GGA AAA CCG CAG TAT ATG GTT CTG GTC CCC TCC CTG CTC CAC 1 
Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro Ser Leu Leu His 
25 30 35 40 

ACT GAG ACC ACT GAG AAG GGC TGT GTC CTT CTG AGC TAC CTG AAT GAG 1 
Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser Tyr Leu Asn Glu 

45 50 55 

ACA GTG ACT GTA AGT GCT TCC TTG GAG TCT GTC AGG GGA AAC AGG AGC 2 
Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg Gly Asn Arg Ser 

60 65 70 

CTC TTC ACT GAC CTG GAG GCG GAG AAT GAC GTA CTC CAC TGT GTC GCC 2 
Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu His Cys Val Ala 
75 80 85 

TTC GCT GTC CCA AAG TCT TCA TCC AAT GAG GAG GTA ATG TTC CTC ACT 3 
Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val Met Phe Leu Thr 
90 95 100 

GTC CAA GTG AAA GGA CCA ACC CAA GAA TTT AAG AAG CGG ACC ACA GTG 3 
Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys Arg Thr Thr Val 
105 110 115 120 

ATG GTT AAG AAC GAG GAC AGT CTG GTC TTT GTC CAG ACA GAC AAA TCA 4 
Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin Thr Asp Lys Ser 

125 130 135 

ATC TAC AAA CCA GGG CAG ACA GTG AAA TTT CGT GTT GTC TCC ATG GAT 4 
He Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val Val Ser Met Asp 

140 145 150 

GAA AAC TTT CAC CCC CTG AAT GAG TTG ATT CCA CTA GTA TAC ATT CAG 5 
Glu Asn Phe His Pro Leu Asn Glu Leu He Pro Leu Val Tyr He Gin 
155 160 165 

GAT CCC AAA GGA AAT CGC ATC GCA CAA TGG CAG AGT TTC CAG TTA GAG 5 
Asp Pro Lys Gly Asn Arg He Ala Gin Trp Gin Ser Phe Gin Leu Glu 
170 175 180 
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GGT GGC CTC AAG CAA TTT TCT TTT CCC CTC TCA TCA GAG CCC TTC CAG 628 
Gly Gly Leu Lys Gin Phe Ser Phe Pro Leu Ser Ser Glu Pro Phe Gin 
185 190 195 200 

GGC TCC TAC AAG GTG GTG GTA CAG AAG AAA TCA GGT GGA AGG ACA GAG 676 
Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser Gly Gly Arg Thr Glu 

205 210 215 

CAC CCT TTC ACC GTG GAG GAA TTT GTT CTT CCC AAG TTT GAA GTA CAA 724 
His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lys Phe Glu Val Gin 

220 225 230 

GTA ACA GTG CCA AAG ATA ATC ACC ATC TTG GAA GAA GAG ATG AAT GTA 772 
Val Thr Val Pro Lys lie He Thr He Leu Glu Glu Glu Met Asn Val 
235 240 245 

TCA GTG TGT GGC CTA TAC ACA TAT GGG AAG CCT GTC CCT GGA CAT GTG 820 
Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val Pro Gly His Val 
250 255 260 

ACT GTG AGC ATT TGC AGA AAG TAT AGT GAC GCT TCC GAC TGC CAC GGT 868 
Thr Val Ser He Cys Arg Lys Tyr Ser Asp Ala Ser Asp Cys His Gly 
265 270 275 280 

GAA GAT TCA CAG GCT TTC TGT GAG AAA TTC AGT GGA CAG CTA AAC AGC 916 
Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly Gin Leu Asn Ser 

285 290 295 

CAT GGC TGC TTC TAT CAG CAA GTA AAA ACC AAG GTC TTC CAG CTG AAG 964 
His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val Phe Gin Leu Lys 

300 305 310 

AGG AAG GAG TAT GAA ATG AAA CTT CAC ACT GAG GCC CAG ATC CAA GAA 1012 
Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala Gin He Gin Glu 
315 320 325 

GAA GGA ACA GTG GTG GAA TTG ACT GGA AGG CAG TCC AGT GAA ATC ACA 1060 
Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser Ser Glu He Thr 
330 335 340 

AGA ACC ATA ACC AAA CTC TCA TTT GTG AAA GTG GAC TCA CAC TTT CGA 1108 
Arg Thr He Thr Lys Leu Ser Phe Val Lys Val Asp Ser His Phe Arg 
345 350 355 360 

CAG GGA ATT CCC TTC TTT GGG CAG GTG CGC CTA GTA GAT GGG AAA GGC 1156 
Gin Gly He Pro Phe Phe Gly Gin Val Arg Leu Val Asp Gly Lys Gly 

365 370 375 

GTC CCT ATA CCA AAT AAA GTC ATA TTC ATC AGA GGA AAT GAA GCA AAC 1204 
Val Pro lie Pro Asn Lys Val He Phe He Arg Gly Asn Glu Ala Asn 

380 385 390 

TAT TAC TCC AAT GCT ACC ACG GAT GAG CAT GGC CTT GTA CAG TTC TCT 1252 
Tyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu Val Gin Phe Ser 
395 400 405 
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ATC AAC ACC ACC AAT GTT ATG GGT ACC TCT CTT ACT GTT AGG GTC AAT 1300 

He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr Val Arg Val Asn 

410 415 420 

TAC AAG GAT CGT AGT CCC TGT TAC GGC TAC CAG TGG GTG TCA GAA GAA 1348 

Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp Val Ser Glu Glu 

425 430 435 440 

CAC GAA GAG GCA CAT CAC ACT GCT TAT CTT GTG TTC TCC CCA AGC AAG 1396 

His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe Ser Pro Ser Lys 

445 450 455 

AGC TTT GTC CAC CTT GAG CCC ATG TCT CAT GAA CTA CCC TGT GGC CAT 1444 

Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu Pro Cys Gly His 

460 465 470 

ACT CAG ACA GTC CAG GCA CAT TAT ATT CTG AAT GGA GGC ACC CTG CTG 1492 

Thr Gin Thr Val Gin Ala His Tyr lie Leu Asn Gly Gly Thr Leu Leu 

475 480 485 

GGG CTG AAG AAG CTC TCC TTC TAT TAT CTG ATA ATG GCA AAG GGA GGC 1540 

Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu He Met Ala Lys Gly Gly 

490 495 500 

ATT GTC CGA ACT GGG ACT CAT GGA CTG CTT GTG AAG CAG GAA GAC ATG 1588 

He Val Arg Thr Gly Thr His Gly Leu Leu Val Lys Gin Glu Asp Met 

505 510 515 520 

AAG GGC CAT TTT TCC ATC TCA ATC CCT GTG AAG TCA GAC ATT GCT CCT 1636 

Lys Gly His Phe Ser He Ser He Pro Val Lys Ser Asp He Ala Pro 

525 530 535 

GTC GCT CGG TTG CTC ATC TAT GCT GTT TTA CCT ACC GGG GAC GTG ATT 1684 

Val Ala Arg Leu Leu He Tyr Ala Val Leu Pro Thr Gly Asp Val He 

540 545 550 

GGG GAT TCT GCA AAA TAT GAT GTT GAA AAT TGT CTG GCC AAC AAG GTG 1732 

Gly Asp Ser Ala Lys Tyr Asp Val Glu Asn Cys Leu Ala Asn Lys Val 

555 560 565 

GAT TTG AGC TTC AGC CCA TCA CAA AGT CTC CCA GCC TCA CAC GCC CAC 1780 

Asp Leu Ser Phe Ser Pro Ser Gin Ser Leu Pro Ala Ser His Ala His 

570 575 580 

CTG CGA GTC ACA GCG GCT CCT CAG TCC GTC TGC GCC CTC CGT GCT GTG 1828 

Leu Arg Val Thr Ala Ala Pro Gin Ser Val Cys Ala Leu Arg Ala Val 

585 590 595 600 

GAC CAA AGC GTG CTG CTC ATG AAG CCT GAT GCT GAG CTC TCG GCG TCC 1876 

Asp Gin Ser Val Leu Leu Met Lys Pro Asp Ala Glu Leu Ser Ala Ser 

605 610 615 

TCG GTT TAC AAC CTG CTA CCA GAA AAG GAC CTC ACT GGC TTC CCT GGG 1924 

Ser Val Tyr Asn Leu Leu Pro Glu Lys Asp Leu Thr Gly Phe Pro Gly 

620 625 630 
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CCT TTG AAT GAC CAG GAC GAT GAA GAC TGC ATC AAT CGT CAT AAT GTC 
Pro Leu Asn Asp Gin Asp Asp Glu Asp Cys He Asn Arg His Asn Val 
635 640 645 

TAT ATT AAT GGA ATC ACA TAT ACT CCA GTA TCA AGT ACA AAT GAA AAG 
Tyr He Asn Gly He Thr Tyr Thr Pro Val Ser Ser Thr Asn Glu Lys 
650 655 660 

GAT ATG TAC AGC TTC CTA GAG GAC ATG GGC TTA AAG GCA TTC ACC AAC 
Asp Met Tyr Ser Phe Leu Glu Asp Met Gly Leu Lys Ala Phe Thr Asn 
665 670 675 680 

TCA AAG ATT CGT AAA CCC AAA ATG TGT CCA CAG CTG CAG TCA GTG TCA 
Ser Lys He Arg Lys Pro Lys Met Cys Pro Gin Leu Gin Ser Val Ser 

685 690 695 

GCC GGC GCC GTG GGA CAG GGA TAT TAT GGA GCC GGA CTG GGA GTG GTG 
Ala Gly Ala Val Gly Gin Gly Tyr Tyr Gly Ala Gly Leu Gly Val Val 

700 705 710 

GAG AGG CCT TAT GTG CCT CAG CTG GGT ACC TAT AAT GTG ATC CCT CTG 
Glu Arg Pro Tyr Val Pro Gin Leu Gly Thr Tyr Asn Val He Pro Leu 
715 720 725 

AAT AAT GAG CAG AGC TCA GGA CCT GTG CCT GAG ACA GTG AGG AAG TAT 
Asn Asn Glu Gin Ser Ser Gly Pro Val Pro Glu Thr Val Arg Lys Tyr 
730 735 740 

TTC CCT GAG ACA TGG ATC TGG GAT CTG GTG GTG GTG AAT TCC GCG GGT 
Phe Pro Glu Thr Trp He Trp Asp Leu Val Val Val Asn Ser Ala Gly 
745 750 755 760 

GTG GCT GAG GTA GGA GTA ACA GTC CCT GAC ACC ATC ACC GAG TGG AAG 
Val Ala Glu Val Gly Val Thr Val Pro Asp Thr lie Thr Glu Trp Lys 

765 770 775 

GCA GGG GCC TTC TGC CTG TCT GAA GAT GCT GGA CTT GGT ATC TCT TCC 
Ala Gly Ala Phe Cys Leu Ser Glu Asp Ala Gly Leu Gly lie Ser Ser 

780 785 790 

ACT GCC TCT CTC CGA GCC TTC CAG CCC TTC TTT GTG GAG CTC ACA ATG 
Thr Ala Ser Leu Arg Ala Phe Gin Pro Phe Phe Val Glu Leu Thr Met 
795 800 805 

CCT TAC TCT GTG ATT CGT GGA GAG GCC TTC ACA CTC AAG GCC ACG GTC 
Pro Tyr Ser Val lie Arg Gly Glu Ala Phe Thr Leu Lys Ala Thr Val 
810 815 820 

CTA AAC TAC CTT CCC AAA TGC ATC CGG GTC AGT GTG CAG CTG GAA GCC 
Leu Asn Tyr Leu Pro Lys Cys He Arg Val Ser Val Gin Leu Glu Ala 
825 830 835 840 

TCT CCC GCC TTC CTA GCT GTC CCA GTG GAG AAG GAA CAA GCG CCT CAC 
Ser Pro Ala Phe Leu Ala Val Pro Val Glu Lys Glu Gin Ala Pro His 

845 850 855 



1972 



2020 



2068 



2116 



2164 



2212 



2260 



2308 



2356 



2404 



2452 



2500 



2548 



2596 
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TGC ATC TGT GCA AAC GGG CGG CAA ACT GTG TCC TGG GCA GTA ACC CCA 2644 
Cys He Cys Ala Asn Gly Arg Gin Thr Val Ser Trp Ala Val Thr Pro 

860 865 870 

AAG TCA TTA GGA AAT GTG AAT TTC ACT GTG AGC GCA GAG GCA CTA GAG 2692 
Lys Ser Leu Gly Asn Val Asn Phe Thr Val Ser Ala Glu Ala Leu Glu 
875 880 885 

TCT CAA GAG CTG TGT GGG ACT GAG GTG CCT TCA GTT CCT GAA CAC GGA 2740 
Ser Gin Glu Leu Cys Gly Thr Glu Val Pro Ser Val Pro Glu His Gly 
890 895 900 

AGG AAA GAC ACA GTC ATC AAG CCT CTG TTG GTT GAA CCT GAA GGA CTA 2788 
Arg Lys Asp Thr Val He Lys Pro Leu Leu Val Glu Pro Glu Gly Leu 
905 910 915 920 

GAG AAG GAA ACA ACA TTC AAC TCC CTA CTT TGT CCA TCA GGT GGT GAG 2836 
Glu Lys Glu Thr Thr Phe Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu 

925 930 935 

GTT TCT GAA GAA TTA TCC CTG AAA CTG CCA CCA AAT GTG GTA GAA GAA 2884 
Val Ser Glu Glu Leu Ser Leu Lys Leu Pro Pro Asn Val Val Glu Glu 

940 945 950 

TCT GCC CGA GCT TCT GTC TCA GTT TTG GGA GAC ATA TTA GGC TCT GCC 2932 
Ser Ala Arg Ala Ser Val Ser Val Leu Gly Asp He Leu Gly Ser Ala 
955 960 965 

ATG CAA AAC ACA CAA AAT CTT CTC CAG ATG CCC TAT GGC TGT GGA GAG 2980 
Met Gin Asn Thr Gin Asn Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu 
970 975 980 

CAG AAT ATG GTC CTC TTT GCT CCT AAC ATC TAT GTA CTG GAT TAT CTA 3028 
Gin Asn Met Val Leu Phe Ala Pro Asn He Tyr Val Leu Asp Tyr Leu 
985 990 995 1000 

AAT GAA ACA CAG CAG CTT ACT CCA GAG ATC AAG TCC AAG GCC ATT GGC 3076 
Asn Glu Thr Gin Gin Leu Thr Pro Glu He Lys Ser Lys Ala He Gly 

1005 1010 1015 

TAT CTC AAC ACT GGT TAC CAG AGA CAG TTG AAC TAC AAA CAC TAT GAT 3124 
Tyr Leu Asn Thr Gly Tyr Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp 

1020 1025 1030 

GGC TCC TAC AGC ACC TTT GGG GAG CGA TAT GGC AGG AAC CAG GGC AAC 3172 
Gly Ser Tyr Ser Thr Phe Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn 
1035 1040 1045 

ACC TGG CTC ACA GCC TTT GTT CTG AAG ACT TTT GCC CAA GCT CGA GCC 3220 
Thr Trp Leu Thr Ala Phe Val Leu Lys Thr Phe Ala Gin Ala Arg Ala 
1050 1055 1060 

TAC ATC TTC ATC GAT GAA GCA CAC ATT ACC CAA GCC CTC ATA TGG CTC 3268 
Tyr He Phe He Asp Glu Ala His He Thr Gin Ala Leu He Trp Leu 
1065 1070 1075 1080 
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TCC CAG AGG CAG AAG GAC AAT GGC TGT TTC AGG AGC TCT GGG TCA CTG 3316 
Ser Gin Arg Gin Lys Asp Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu 

1085 1090 1095 

CTC AAC AAT GCC ATA AAG GGA GGA GTA GAA GAT GAA GTG ACC CTC TCC 3364 
Leu Asn Asn Ala He Lys Gly Gly Val Glu Asp Glu Val Thr Leu Ser 

1100 1105 1110 

GCC TAT ATC ACC ATC GCC CTT CTG GAG ATT CCT CTC ACA GTC ACT CAC 3412 
Ala Tyr He Thr He Ala Leu Leu Glu He Pro Leu Thr Val Thr His 
1115 1120 1125 

CCT GTT GTC CGC AAT GCC CTG TTT TGC CTG GAG TCA GCC TGG AAG ACA 3460 
Pro Val Val Arg Asn Ala Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr 
1130 1135 1140 

GCA CAA GAA GGG GAC CAT GGC AGC CAT GTA TAT ACC AAA GCA CTG CTG 3508 
Ala Gin Glu Gly Asp His Gly Ser His Val Tyr Thr Lys Ala Leu Leu 
1145 1150 1155 1160 

GCC TAT GCT TTT GCC CTG GCA GGT AAC CAG GAC AAG AGG AAG GAA GTA 3556 
Ala Tyr Ala Phe Ala Leu Ala Gly Asn Gin Asp Lys Arg Lys Glu Val 

1165 1170 1175 

CTC AAG TCA CTT AAT GAG GAA GCT GTG AAG AAA GAC AAC TCT GTC CAT 3604 
Leu Lys Ser Leu Asn Glu Glu Ala Val Lys Lys Asp Asn Ser Val His 

1180 1185 1190 

TGG GAG CGC CCT CAG AAA CCC AAG GCA CCA GTG GGG CAT TTT TAC GAA 3652 
Trp Glu Arg Pro Gin Lys Pro Lys Ala Pro Val Gly His Phe Tyr Glu 
1195 1200 1205 

CCC CAG GCT CCC TCT GCT GAG GTG GAG ATG ACA TCC TAT GTG CTC CTC 3700 
Pro Gin Ala Pro Ser Ala Glu Val Glu Met Thr Ser Tyr Val Leu Leu 
1210 1215 1220 

GCT TAT CTC ACG GCC CAG CCA GCC CCA ACC TCG GAG GAC CTG ACC TCT 3748 
Ala Tyr Leu Thr Ala Gin Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser 
1225 1230 1235 1240 

GCA ACC AAC ATC GTG AAG TGG ATC ACG AAG CAG CAG AAT GCC CAG GGC 3796 
Ala Thr Asn He Val Lys Trp He Thr Lys Gin Gin Asn Ala Gin Gly 

1245 1250 1255 

GGT TTC TCC TCC ACC CAG CAC ACA GTG GTG GCT CTC CAT GCT CTG TCC 3844 
Gly Phe Ser Ser Thr Gin His Thr Val Val Ala Leu His Ala Leu Ser 

1260 1265 1270 

AAA TAT GGA GCA GCC ACA TTT ACC AGG ACT GGG AAG GCT GCA CAG GTG 3892 
Lys Tyr Gly Ala Ala Thr Phe Thr Arg Thr Gly Lys Ala Ala Gin Val 
1275 1280 1285 

ACT ATC CAG TCT TCA GGG ACA TTT TCC AGC AAA TTC CAA GTG GAC AAC 3940 
Thr He Gin Ser Ser Gly Thr Phe Ser Ser Lys Phe Gin Val Asp Asn 
1290 1295 1300 
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AAC AAC CGC CTG TTA CTG CAG CAG GTC TCA TTG CCA GAG CTG CCT GGG 3988 

Asn Asn Arg Leu Leu Leu Gin Gin Val Ser Leu Pro Glu Leu Pro Gly 

1305 1310 1315 1320 

GAA TAC AGC ATG AAA GTG ACA GGA GAA GGA TGT GTC TAC CTC CAG ACA 4036 
Glu Tyr Ser Met Lys Val Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr 

1325 1330 1335 

TCC TTG AAA TAC AAT ATT CTC CCA GAA AAG GAA GAG TTC CCC TTT GCT 4084 
Ser Leu Lys Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala 

1340 1345 1350 

TTA GGA GTG CAG ACT CTG CCT CAA ACT TGT GAT GAA CCC AAA GCC CAC 4132 
Leu Gly Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His 
1355 1360 1365 

ACC AGC TTC CAA ATC TCC CTA AGT GTC AGT TAC ACA GGG AGC CGC TCT 4180 
Thr Ser Phe Gin lie Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser 
1370 1375 1380 

GCC TCC AAC ATG GCG ATC GTT GAT GTG AAG ATG GTC TCT GGC TTC ATT 4228 
Ala Ser Asn Met Ala He Val Asp Val Lys Met Val Ser Gly Phe He 
1385 1390 1395 1400 

CCC CTG AAG CCA ACA GTG AAA ATG CTT GAA AGA TCT AAC CAT GTG AGC 4276 
Pro Leu Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser 

1405 1410 1415 

CGG ACA GAA GTC AGC AGC AAC CAT GTC TTG ATT TAC CTT GAT AAG GTG 4324 
Arg Thr Glu Val Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val 

1420 1425 1430 

TCA AAT CAG ACA CTG AGC TTG TTC TTC ACG GTT CTG CAA GAT GTC CCA 4372 
Ser Asn Gin Thr Leu Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro 
1435 1440 1445 

GTA AGA GAT CTG AAA CCA GCC ATA GTG AAA GTC TAT GAT TAC TAC GAG 4420 
Val Arg Asp Leu Lys Pro Ala He Val Lys Val Tyr Asp Tyr Tyr Glu 
1450 1455 1460 

ACG GAT GAG TTT GCA ATT GCT GAG TAC AAT GCT CCT TGC AGC AAA GAT 4468 
Thr Asp Glu Phe Ala He Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp 
1465 1470 1475 1480 

CTT GGA AAT GCT TGAAGACCAC AAGGCTGAAA AGTGCTTTGC TGGAGTCCTG 4520 
Leu Gly Asn Ala 



TTCTCTGAGC TCCACAGAAG ACACGTGTTT TTGTATCTTT AAAGACTTGA TGAATAAACA 4580 
CTTTTTCTGG TCAAAAAAA 4599 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1484 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(E) FEATURES: bait region: 690-740 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Gly Lys Asn Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu 
15 10 15 

Val Leu Leu Pro Thr Asp Ala Ser Val Ser Gly Lys Pro Gin Tyr Met 

20 25 30 

Val Leu Val Pro Ser Leu Leu His Thr Glu Thr Thr Glu Lys Gly Cys 
35 40 45 

Val Leu Leu Ser Tyr Leu Asn Glu Thr Val Thr Val Ser Ala Ser Leu 
50 55 60 

Glu Ser Val Arg Gly Asn Arg Ser Leu Phe Thr Asp Leu Glu Ala Glu 
65 70 75 80 

Asn Asp Val Leu His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser 

85 90 95 

Asn Glu Glu Val Met Phe Leu Thr Val Gin Val Lys Gly Pro Thr Gin 

100 105 110 

Glu Phe Lys Lys Arg Thr Thr Val Met Val Lys Asn Glu Asp Ser Leu 
115 120 125 

Val Phe Val Gin Thr Asp Lys Ser He Tyr Lys Pro Gly Gin Thr Val 
130 135 140 

Lys Phe Arg Val Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu 
145 150 155 160 

Leu He Pro Leu Val Tyr He Gin Asp Pro Lys Gly Asn Arg He Ala 

165 170 175 

Gin Trp Gin Ser Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe 

180 185 190 

Pro Leu Ser Ser Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin 
195 200 205 

Lys Lys Ser Gly Gly Arg Thr Glu His Pro Phe Thr Val Glu Glu Phe 
210 215 220 

Val Leu Pro Lys Phe Glu Val Gin Val Thr Val Pro Lys He He Thr 
225 230 235 240 

He Leu Glu Glu Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr 

245 250 255 

Gly Lys Pro Val Pro Gly His Val Thr Val Ser He Cys Arg Lys Tyr 

260 265 270 
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Ser Asp Ala Ser Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Glu 
275 280 285 

Lys Phe Ser Gly Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val 
290 295 300 

Lys Thr Lys Val Phe Gin Leu Lys Arg Lys Glu Tyr Glu Met Lys Leu 
305 310 315 320 

His Thr Glu Ala Gin He Gin Glu Glu Gly Thr Val Val Glu Leu Thr 

325 330 335 

Gly Arg Gin Ser Ser Glu He Thr Arg Thr He Thr Lys Leu Ser Phe 

340 345 350 

Val Lys Val Asp Ser His Phe Arg Gin Gly He Pro Phe Phe Gly Gin 
355 350 365 

Val Arg Leu Val Asp Gly Lys Gly Val Pro He Pro Asn Lys Val He 
370 375 380 

Phe lie Arg Gly Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp 
385 390 395 400 

Glu His Gly Leu Val Gin Phe Ser He Asn Thr Thr Asn Val Met Gly 

405 410 415 

Thr Ser Leu Thr Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr 

420 425 430 

Gly Tyr Gin Trp Val Ser Glu Glu His Glu Glu Ala His His Thr Ala 
435 440 445 

Tyr Leu Val Phe Ser Pro Ser Lys Ser Phe Val His Leu Glu Pro Met 
450 455 460 

Ser His Glu Leu Pro Cys Gly His Thr Gin Thr Val Gin Ala His Tyr 
455 470 475 480 

He Leu Asn Gly Gly Thr Leu Leu Gly Leu Lys Lys Leu Ser Phe Tyr 

485 490 495 

Tyr Leu He Met Ala Lys Gly Gly He Val Arg Thr Gly Thr His Gly 

500 505 510 

Leu Leu Val Lys Gin Glu Asp Met Lys Gly His Phe Ser He Ser He 
515 520 525 

Pro Val Lys Ser Asp lie Ala Pro Val Ala Arg Leu Leu He Tyr Ala 
530 535 540 

Val Leu Pro Thr Gly Asp Val He Gly Asp Ser Ala Lys Tyr Asp Val 
545 550 555 560 

Glu Asn Cys Leu Ala Asn Lys Val Asp Leu Ser Phe Ser Pro Ser Gin 

565 570 575 
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Ser Leu Pro Ala Ser His Ala His Leu Arg Val Thr Ala Ala Pro Gin 

580 585 590 

Ser Val Cys Ala Leu Arg Ala Val Asp Gin Ser Val Leu Leu Met Lys 
595 600 605 

Pro Asp Ala Glu Leu Ser Ala Ser Ser Val Tyr Asn Leu Leu Pro Glu 
610 615 620 

Lys Asp Leu Thr Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu 
625 630 635 640 

Asp Cys He Asn Arg His Asn Val Tyr He Asn Gly He Thr Tyr Thr 

645 650 655 

Pro Val Ser Ser Thr Asn Glu Lys Asp Met Tyr Ser Phe Leu Glu Asp 

660 665 670 

Met Gly Leu Lys Ala Phe Thr Asn Ser Lys He Arg Lys Pro Lys Met 
675 680 685 

Cys Pro Gin Leu Gin Ser Val Ser Ala Gly Ala Val Gly Gin Gly Tyr 
690 695 700 

Tyr Gly Ala Gly Leu Gly Val Val Glu Arg Pro Tyr Val Pro Gin Leu 
705 710 715 720 

Gly Thr Tyr Asn Val He Pro Leu Asn Asn Glu Gin Ser Ser Gly Pro 

725 730 735 

Val Pro Glu Thr Val Arg Lys Tyr Phe Pro Glu Thr Trp He Trp Asp 

740 745 750 

Leu Val Val Val Asn Ser Ala Gly Val Ala Glu Val Gly Val Thr Val 
755 760 765 

Pro Asp Thr He Thr Glu Trp Lys Ala Gly Ala Phe Cys Leu Ser Glu 
770 775 780 

Asp Ala Gly Leu Gly He Ser Ser Thr Ala Ser Leu Arg Ala Phe Gin 
785 790 795 800 

Pro Phe Phe Val Glu Leu Thr Met Pro Tyr Ser Val He Arg Gly Glu 

805 810 815 

Ala Phe Thr Leu Lys Ala Thr Val Leu Asn Tyr Leu Pro Lys Cys He 

820 825 830 

Arg Val Ser Val Gin Leu Glu Ala Ser Pro Ala Phe Leu Ala Val Pro 
835 840 845 

Val Glu Lys Glu Gin Ala Pro His Cys He Cys Ala Asn Gly Arg Gin 
850 855 860 

Thr Val Ser Trp Ala Val Thr Pro Lys Ser Leu Gly Asn Val Asn Phe 
865 870 875 880 
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Thr Val Ser Ala Glu Ala Leu Glu Ser Gin Glu Leu Cys Gly Thr Glu 

885 890 895 

Val Pro Ser Val Pro Glu His Gly Arg Lys Asp Thr Val He Lys Pro 

900 905 910 

Leu Leu Val Glu Pro Glu Gly Leu Glu Lys Glu Thr Thr Phe Asn Ser 
915 920 925 

Leu Leu Cys Pro Ser Gly Gly Glu Val Ser Glu Glu Leu Ser Leu Lys 
930 935 940 

Leu Pro Pro Asn Val Val Glu Glu Ser Ala Arg Ala Ser Val Ser Val 
945 950 955 960 

Leu Gly Asp He Leu Gly Ser Ala Met Gin Asn Thr Gin Asn Leu Leu 

965 970 975 

Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe Ala Pro 

980 985 990 

Asn lie Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Gin Leu Thr Pro 
995 1000 1005 

Glu He Lys Ser Lys Ala lie Gly Tyr Leu Asn Thr Gly Tyr Gin Arg 
1010 1015 1020 

Gin Leu Asn Tyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe Gly Glu 
1025 1030 1035 1040 

Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp Leu Thr Ala Phe Val Leu 

1045 1050 1055 

Lys Thr Phe Ala Gin Ala Arg Ala Tyr lie Phe He Asp Glu Ala His 

1060 1065 1070 

He Thr Gin Ala Leu He Trp Leu Ser Gin Arg Gin Lys Asp Asn Gly 
1075 1080 1085 



Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys Gly Gly 

1090 1095 1100 

Val Glu Asp Glu Val Thr Leu Ser Ala Tyr He Thr lie Ala Leu Leu 
1105 1110 1115 1120 

Glu He Pro Leu Thr Val Thr His Pro Val Val Arg Asn Ala Leu Phe 

1125 1130 1135 

Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His Gly Ser 

1140 1145 1150 

His Val Tyr Thr Lys Ala Leu Leu Ala Tyr Ala Phe Ala Leu Ala Gly 
1155 1160 1165 

Asn Gin Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu Glu Ala 

1170 1175 1180 
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Val Lys Lys Asp Asn Ser Val His Trp Glu Arg Pro Gin Lys Pro Lys 
1185 1190 1195 1200 

Ala Pro Val Gly His Phe Tyr Glu Pro Gin Ala Pro Ser Ala Glu Val 

1205 1210 1215 

Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin Pro Ala 

1220 1225 1230 

Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr Asn He Val Lys Trp He 
1235 1240 1245 

Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin His Thr 
1250 1255 1260 

Val Val Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr Phe Thr 
1265 1270 1275 1280 

Arg Thr Gly Lys Ala Ala Gin Val Thr He Gin Ser Ser Gly Thr Phe 

1285 1290 1295 

Ser Ser Lys Phe Gin Val Asp Asn Asn Asn Arg Leu Leu Leu Gin Gin 

1300 1305 1310 

Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val Thr Gly 
1315 1320 1325 

Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu Lys Tyr Asn He Leu Pro 
1330 1335 1340 

Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly Val Gin Thr Leu Pro Gin 
1345 1350 1355 1360 

Thr Cys Asp Glu Pro Lys Ala His Thr Ser Phe Gin lie Ser Leu Ser 

1355 1370 1375 

Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala He Val Asp 

1380 1385 1390 

Val Lys Met Val Ser Gly Phe He Pro Leu Lys Pro Thr Val Lys Met 
1395 1400 1405 

Leu Glu Arg Ser Asn His Val Ser Arg Thr Glu Val Ser Ser Asn His 
1410 1415 1420 

Val Leu He Tyr Leu Asp Lys Val Ser Asn Gin Thr Leu Ser Leu Phe 
1425 1430 1435 1440 

Phe Thr Val Leu Gin Asp Val Pro Val Arg Asp Leu Lys Pro Ala He 

1445 1450 1455 

Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp Glu Phe Ala lie Ala Glu 

1460 1465 1470 

Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly Asn Ala 
1475 1480 
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PATENT CLAIMS 

1- A process for the production of recombinant or-macroglobulin, 

variants, fragments or derivatives thereof, wherein a functionally operative 
expression vector comprising a gene encoding for the expression of a- 
5 macroglobulin, variants, fragments or derivatives thereof, or alleles of 
such a gene, is introduced into a suitable host capable of expressing said 
gene, said host is cultured in a suitable nutrient medium containing sources 
of assimilable carbon and nitrogen and other essential nutrients, and the 
expressed a-macroglobul in or fragments or derivatives thereof is recovered. 

10 

2. The process of claim 1, wherein said gene encodes for the 

expression of human a 2 -macroglobul in, variants, fragments or derivatives 
thereof. 



15 3 * The process of claim 2, wherein said gene encodes for the 

expression of human a 2 -macroglobul in having the amino acid sequence of SEQ 
ID N0:2, or a fragment or derivative thereof. 

4 - The process of claim 2 or 3, wherein said gene comprises the DNA 
20 sequence of SEQ ID N0:1, or a fragment thereof. 

5 * The process of claim 1 or 2, wherein said gene encodes for a 

variant a-macroglobul in, in which the amino acid sequence of the bait region 
has been altered. 

25 

5 - The process of claim 5, wherein the bait region has been altered 
by incorporation of further proteinase target sites. 

7 - The process of claim 5, wherein the bait region has been altered 
30 by removal of proteinase target sites. 

8 - The process of claim 5, wherein the bait region has been altered 
by replacing one or more specific proteinase target sites with one or more 
other specific proteinase target sites. 

35 

9 - The process of claim 8, wherein said proteinase target sites are 
specific for bovine trypsin, Streptomvces qriseus trypsin, papain, porcine 
elastase, bovine chymosin, bovine chymotrypsin, Staphylococcus aureus strain 
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V8 proteinase, human plasmin, bovine thrombin, thermolysin, subtil isin Novo 
and/or Streptomvces griseus proteinase B. 

10. The process of claim 5, wherein wherein the bait region has been 
5 altered by replacing said bait region or part thereof with a bait region or 

a part thereof from another a-macroglobulin. 

11. The process of claim 10, wherein said bait regions originate from 
human a 2 M, Pregnancy Zone Protein (PZP), rat g^M, rat or^, rat aj 3 variant 

10 1, or rat a,I 3 variant 2 (aJa = a r inhibitor 3), especially PZP. 

12. The process of any of claims 5 to 11, wherein said gene encodes 
for the expression of human a a 2 -macroglobul in variant having the amino acid 
sequence of SEQ ID N0:4, or a fragment or derivative thereof. 

15 

13. The process of any of claims 5 to 12, wherein said gene comprises 
the DNA sequence of SEQ ID N0:3, or a fragment thereof. 

14. The process of any of the claims 1 to 13, wherein said gene is 
20 a synthetic gene. 

15. The process of any of the claims 1 to 14, wherein said a- 
macroglobulin, variant, fragment or derivative thereof is co-expressed with 
a desired gene product. 

25 

16. The process of any of the claims 1 to 15, wherein said gene is, 
or is derived from, a human gene. 

17. The process of any of the claims 1 to 16, wherein said host is 
30 a bacterial strain, a fungal strain, a mammalian cell line, or a mammal. 

18. The process of claim 17, wherein said host is a fungus. 

19. The process of claim 18, wherein said fungus belongs to the genus 
35 Aspergillus . 

20 The process of claim 18, wherein said host is a yeast. 
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21. The process of claim 20, wherein said yeast belongs to the genus 
Saccharomvces . 

22. The process of claim 17, wherein said host is a mammalian cell 
5 line. 

23. The process of claim 22, wherein said mammalian cell line is a 
Syrian Baby Hamster Kidney (BKH) cell line. 

10 24. The process of claim 23, wherein said cell line is available from 

ATCC under No. CRL 1632. 

25. A DNA sequence comprising a gene encoding for the expression of 

an a-macroglobul in, variants, fragments or derivatives thereof. 

15 

26 The DNA sequence of claim 25, wherein said gene encodes for human 

c*2-niacroglobul in. 

27. The DNA sequence of claim 25, wherein said gene encodes for the amino 
20 acid sequence of SEQ ID N0:2 or a fragment or derivative thereof. 

28. The DNA sequence of claim 26 or 27, wherein said gene has the 
nucleotide sequence of SEQ ID N0:1 or a fragment thereof. 

25 29. The DNA sequence of claim 25 or 26, wherein said gene encodes 

for a variant a-macroglobul in, in which the amino acid sequence of the bait 
region has been altered. 

30. The DNA sequence of claim 29, wherein said bait region has been 
30 altered by incorporation of further proteinase target sites. 

31. The DNA sequence of claim 29, wherein said bait region has been 
altered by removal of proteinase target sites. 

35 32. The DNA sequence of claim 29, wherein said bait region has been 

altered by replacing one or more specific proteinase target sites with one 
or more other specific proteinase target sites. 
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33. The DNA sequence of claim 29, wherein, wherein said proteinase 
target sites are specific for bovine trypsin, Streptomvces griseus trypsin, 
papain, porcine elastase, bovine chymosin, bovine chymotrypsin, Staphylococ- 
cus aureus strain V8 proteinase, human plasmin, bovine thrombin, thermoly- 

5 sin, subtilisin Novo and/or Streptomvces griseus proteinase B. 

34. The DNA sequence of claim 29, wherein the bait region has been 
altered by replacing said bait region or part thereof with a bait region or 
a part thereof from another a-macroglobul in. 

10 

35. The DNA sequence of claim 34, wherein said bait region originates 
from human <* 2 M, Pregnancy Zone Protein (PZP), rat a,M, rat rat aj 3 
variant 1, or rat ajg variant 2, especially PZP. 

15 36. A functionally operative expression vector comprising a gene in 

accordance with any of the claims 25 to 35 for the expression of human a 2 - 
macroglobul in, variants, fragments or derivatives thereof, or alleles of 
such a gene. 

20 37. The vector of claim 36, further comprising regulatory elements 

necessary for the stable maintenance of said vector in mammalian cells. 

38. The vector of claim 36 or 37, further comprising sequences 
providing for the processing and secretion of the expressed product. 

25 

39. The vector of any of the claims 36 to 38, further comprising one 
or more other genes encoding for a desired gene product. 

40. A functionally operative expression vector comprising a gene 
30 encoding for the expression of an a-macroglobul in, variants, fragments or 

derivatives thereof, or alleles of such a gene, essentially as described. 

41. A transformed host comprising a functionally operative expression 
vector comprising a gene encoding for the expression of human o; 2 -macro- 

35 globulin or fragments or derivatives thereof, or alleles of such a gene. 

42. The host of claim 41, wherein said vector is the vector of any 
of the claims 36 to 40. 



WO 91/03557 PCT/DK90/00225 

63 

43. The host of claim 41 or 42, wherein said host is a bacterial 
strain, a fungal strain, a mammalian cell line, or a mammal. 

44. The host of claim 43, wherein said host is a fungus. 

5 

45. The host of claim 44, wherein said fungus belongs to the genus 
Aspergillus . 

46. The host of claim 44, wherein said host is a yeast. 

10 

47. The host of claim 46, wherein said host belongs to the genus Sac- 
charomvces . 

48. The host of claim 43, wherein said host is a mammalian cell line. 

15 

49. The host of claim 48, wherein said host is a Syrian Baby Hamster 
Kidney (BHK) cell line. 

50 - The host of claim 49, wherein said cell line is available from 

20 ATCC under No. CRL 1632. 

51 - Recombinant human a 2 -macroglobul in of SEQ ID N0:2 or SEQ ID N0:4 

in an active form. 

25 52- Recombinant a-macroglobul in, variants, fragments or derivatives 

thereof produced by a process of any of the claims 1 to 24. 

53. Recombinant a-macroglobul in, variants, fragments or derivatives 

thereof of claim 52 produced by the use of a vector of any of the claims 36 
30 to 40. 



54. Recombinant a-macroglobul in, variants, fragments or derivatives 

thereof essentially as described. 

35 55. Recombinant human a2-macroglobul in, variants, fragments or 

derivatives thereof essentially as described. 

56. A growth medium comprising one or more a-macroglobul ins . 
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57. A growth medium comprising recombinant a-macroglobulin, variants, 
fragments or derivatives thereof according to any of the claims 51 to 55. 

58. Use of recombinant a-macroglobul in, variants, fragments or 
5 derivatives thereof according to any of the claims 51 to 55 as a protein 

carrier in enzyme replacement therapy. 

59. Use of recombinant a-macroglobul in, variants, fragments or 
derivatives thereof according to any of the claims 51 to 55 as a DNA carrier 

10 in gene therapy. 
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